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ABSTRACT 

For  the  problem  of  efficiency-robust  estimation  of  the 
location  parameter  9  of  a  family  of  symmetric  pdf's  f(x-0tA), 
A  e  A  =  {l,...,m},  6  e  0  =  I0|-oo<e<oo  },  the  method  of 
"mixture  models"  of  Blrnbaum  (Ann.  Math.  Statist.  32  (I961) 
622)  Is  applied  to  determine  generalized  Pitman  estimators, 
which  are  shown  to  be  admissible,  with  squared  error  loss 
function,  under  broad  regularity  conditions.   With  increasing 
sample  size,  these  estimators  are  proved  to  be  fully 
efficient  (i.e.,  asymptotically  equivalent,  for  each  value 
of  A,  to  the  maximum  likelihood  estimator  which  would  be 
appropriate  if  the  true  value  of  A  were  known).   Computationally 
tractable  analogous  estimators  based  on  k  sample  quantlles 
are  defined  in  the  context  of  the  model  representing  their 
asymptotic  normal  distributions.   It  is  shown  that  with 
increasing  k  these  approach  equivalence  to  the  fully 
efficient  estimators  based  on  complete  samples.   Equivalent 
estimators  are  given  also  for  the  case  of  unknown  scale 
parameters.   Efficiency-robust  linear  unbiased  estimators 
based  on  sample  quantlles  are  derived  and  the  optimal  spacing 
of  quantlles  is  discussed. 
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Chapter  1 
INTRODUCTION 

The  term  "robust"  In  the  title  of  this  paper  refers 
to  the  broad  program  for  development  of  efficiency-robust 
estimators  initiated  by  Tukey  (I96O,  I962). 

A  general  method  for  constructing  optimally  efficiency- 
robust  inference  techniques  was  given  by  Birnbaum  (I96I). 
In  the  chapters  that  follow  we  study  the  properties  of 
various  estimators  of  location  obtained  by  this  method. 

We  assume  in  this  approach  that  we  are  given  n 
Independent,  identically  distributed  random  variables 
X^,...,  X  with  pdf  f(x  -  e|A),  where  f  is  continuous  and 
symmetric,  ee@={0|-oo<9<oo},  and  A  e  7\  =  {1,...,  m} 
is  a  "shape"  parameter;  we  are  interested  in  finding  estimators 
that  are  in  some  sense  optimum  uniformly  over  all  shapes  A. 
BlrnbaT;im's  method  yields  estimators  admissible  over  the 
parameter  space  ®  X  A  with  respect  to  some  specified  risk 
function.   (A  more  detailed  discussion  of  his  method  is 
given  in  Section  2.2  and  also  in  Birnbaum  and  Laska  (1967a).) 

In  Chapter  2  the  efficiency-robust  version  of  Pitman's 
(1959)  estimator  of  location,  the  "generalized  Pitman 
estimator,"  is  examined.   The  admissibility  of  this  estimator 
over  @  K  A  was  indicated  by  Birnbaum  (I961).   In  Theorem  2.2 
It  is  shown  that,  under  mild  regularity  conditions,  for  fixed 


n  the  generalized  Pitman  estimator  Is  unbiased,  has  variance 
Independent  of  6   for  every  A  e  A ,  and  Is  admissible  over 
©  X  A  with  variance  as  risk  function. 

The  main  result  of  the  chapter  Is  Theorem  2.6,  In  which 
we  prove  that  under  suitable  regularity  conditions  the 
generalized  Pitman  estimator  Is  asymptotically  fully  efficient 
under  each  shape  A;  that  Is,  It  Is  asymptotically  equivalent, 
under  each  shape,  to  the  maximum  likelihood  estimator 
appropriate  for  that  shape. 

Since  the  Pitman  estimator  Is  In  general  quite  difficult 
to  compute  even  for  a  single  shape,  in  Chapter  3  we  consider 
an  approximate  version. 

Hosteller  (19^6)  proposed  a  method  of  estimation  based 
on  sample  quantiles,  in  which,  given  a  sample  of  size  n, 
only  a  subset  of  size  k  is  used  in  the  construction  of  the 
estimators.   After  the  observations  have  been  ordered,  the 
k  sample  quantiles  x,   ^  (1  =  1,...,  k;  x,   ^  ^^/^    )^ 
to  be  selected  are  determined  by  the  "spacing" 
{A.;  1  =  1,...,  k},  a  set  of  k  real  numbers  satisfying  the 


<  ...  <  A,  <  1,  as  follows:   n. 


relationship  0  <  A-, 

if  nA.  is  an  Integer,  and  n.  =  [nA^]  +  1  otherwise.   We 

will  refer  to  this  method  henceforth  as  that  of  systematic 

statistics. 

A  basic  tool  in  related  investigations  is  Mosteller's 
result  that  the  k  sample  quantiles  have  an  asymptotic  k-variate 


normal  distribution  with  mean  and  covarlance  matrix  of  known 
form. 

What  makes  these  estimators  especially  attractive  Is 
the  fact  that  optimum  properties  are  approached  as  both  n 
and  k  tend  to  oo ,  while  k/n  tends  to  0.   Extensive  studies 
of  some  systematic  estimators  have  been  made  by  Ogawa  (1951) 
and  Sarndal  (I962)  and  In  all  cases  Investigated  numerically, 
high  efficiencies  were  obtained  already  for  quite  small 
values  of  k  (4  <  k  <_  10). 

In  Chapters  3  -  5  we  consider  the  problem  of  constructing 
efficiency-robust  systematic  estimators.   We  assume  that 
k  and  the  spacing  (A.  }  are  given,  and  that  furthermore  the 
spacing  Is  symmetric,  i.e.,  A^  =  1  -  ^i^_i+i*  1  =  !»•••>  ^- 

In  Chapter  3  we  first  define  the  systematic  generalized 
Pitman  estimator  based  on  the  exact  distribution  of  the  k 
selected  order  statistics.   In  the  context  of  the  latter 
model,  we  show  that  It  has  a  niomber  of  optimum  properties; 
but  this  estimator  Is  still  extremely  difficult  to  compute. 

We  turn  next  to  the  systematic  generalized  Pitman  estimator 
based  on  the  asymptotic  k-variate  normal  distribution  of 
the  sample  quantlles;  ease  of  computation  is  an  immediate 
result.   In  Theorem  3.2  we  prove  that  as  the  sample  size  n 
Increases,  this  estimator  is  asymptotically  efficient  among 
all  estimators  based  on  the  k-varlate  normal  model,  uniformly 
over  all  shapes  A.   Following  it  we  have  in  Theorem  3.3  the 


even  stronger  result  that  the  asymptotic  relative  efficiency 
of  this  systematic  generalized  Pitman  estimator  in  the  class 
of  all  estimators  of  9   can  be  made  arbitrarily  close  to 
\mlty  (by  Increasing  k)  uniformly  over  all  shapes  A. 
(This  result  refers  to  efficiency  defined  in  the  context 
of  the  exact  model,  for  full  samples.) 

An  asymptotically  equivalent  estimator  is  given  in 
Section  3.5  for  the  case  of  practical  importance  in  which 
the  scale  parameters  a^j  A  =  1,...,  m,  are  unknown. 

Chapter  4  deals  with  systematic  linear  estimators. 
Using  the  exact  covarlance  matrix  of  the  sample  quantlles, 
we  extend  to  the  systematic  case  the  best  linear  unbiased 
estimators  (BLUE's)  of  Lloyd  (1952)  and,  for  efficiency- 
robustness,  the  admissible  linear  unbiased  estimators 
(ALUE's)  of  La ska  (I962),  obtaining  the  corresponding  best 
systematic  linear  unbiased  estimators  (BSLUE's)  and  admissible 
systematic  linear  unbiased  estimators  (ASLUE's)  of  Q. 

If  instead  of  the  exact  covarlance  matrix  we  consider 
the  covarlance  matrix  of  the  asymptotic  normal  distribution 
of  the  k  sample  quantlles,  we  have  the  asymptotically  best 
systematic  linear  unbiased  estimators  (ABSLUE's)  studied 
by  Ogawa  (I95I),  and  their  efficiency-robust  counterparts, 
the  asymptotically  admissible  systematic  linear  unbiased 
estimators  (AASLUE's).   The  properties  claimed  for  these 
estimators  are  proved,  by  methods  analogous  to  the  cases 


already  given  In  the  literature. 

Next  we  show  the  rather  useful  result  that  the 
"asymptotic"  estimators,  the  ABSLUE's  and  AASLUE's,  are 
asymptotically  equivalent  to  within  0  (n~  )  to  the  BSLUE's 
and  ASLUE's,  respectively.   Finally,  It  follows  from  the  work 
of  Tlschendorf  (1955)  that,  as  k  — >  oo  ,  the  ABSLUE's  are 
also  asymptotically  efficient  in  the  class  of  all  estimators 
of  0. 

In  Chapter  5  we  consider  systematic  linear  estimators 
based  on  a  continuous  weight  function.   Jung  (1955),  among 
others,  obtained  an  asymptotic  version  of  Lloyd's  best 
linear  unbiased  estimator  which  was  generalized  to  the 
efficiency-robust  case  by  Yhap  (I966,  I967).   Sarndal  (I962) 
gave  the  systematic  version  of  Jung's  result  for  an  arbitrary 
spacing  [A.}.   In  Section  5.I  we  derive  Jung's  estimator 
under  weakened  (necessary  and  sufficient)  conditions,  and  in 
Section  5.2  we  show  that  Sarndal 's  result,  obtained  by  him 
by  a  different  procedure,  follows  quite  simply  from  Jung's 
approach. 

If  we  consider  the  "uniform"  spacing  A.  =  l/(k+l), 
1  =  1,...,  k,  the  systematic  weight  function  coincides  with 
that  of  Jung.   Hence,  for  this  particular  spacing,  the  efficiency- 
robust  solution  is  the  same  for  both  the  systematic  and  full- 
sample  model.   For  this  reason  we  give  a  brief  account  in 
Section  5.5  of  Yhap's  estimator.   His  approach  seems 
to  be  adaptable  to  the  case  of  an  arbitrary  spacing  {  A. }, 


and  we  Indicate  how  such  a  solution  would  be  attempted. 

The  reason  that  the  uniform  spacing  Is  of  real  Interest 
In  Chapter  5  Is  that  numerical  studies  have  yielded  good 
results  for  other  than  "optimum"  spaclngs.  -The  problem-area 
of  the  choice  of  k  and  the  spacing  {  A^  }  Is  discussed  in 
Chapter  6.   There  we  give  a  brief  exposition  of  the  method 
of  optimum  spacing  defined  by  Ogawa  (1951)  and  the  "nearly 
optimum"  solution  proposed  by  Sarndal  (I962).   We  then  define 
the  concept  of  an  admissible  spacing  for  the  efficiency- 
robustness  problem  and  set  up  solutions  analogous  to  those 
of  Ogawa  and  Sarndal.   A  third  possible  approach  is  also 
suggested. 

All  of  the  estimators  here  discussed  are  fully  efficiency- 
robust  in  terms  of  the  admissibility  concept  as  defined  In 
this  context  by  Blrnbaiam  (I961).   In  addition,  the  non-linear 
estimators  treated  in  Chapters  2  and  3  have  full  asymptotic 
efficiency  in  the  class  of  all  estimators,  uniformly  over 
all  shapes  A.   Of  course,  since  these  results  are  asymptotic, 
numerical  investigations  are  needed  to  determine  how  fast 
the  various  optimum  properties  approach  their  asymptotic 
values.   In  addition  all  these  estimators,  and  others 
proposed  for  robust  estimation,  need  to  be  compared  niomerlcally 
for  small  values  of  n  and  k,  various  spaclngs  {X^^},  and 
selected  families  of  underlying  distributions. 

Huber  (196^)  defined  a  so-called  (M) -estimator  which 
he  showed  to  be  minimax  with  respect  to  asjrmptotic 


variance  in  the  class  of  all  translation-Invariant  estimators 
for  a  family  of  shapes  C  whose  cdf's  lie  within  a  certain 
"distance"  of  the  normal. 

Concerning  the  relations  between  minimax  estimators  and 
possible  fully  efficient  estimators,  Huber  has  written  (p.  80): 

"At  first  glance,  it  might  seem  absurd  to  look  for 
as5miptotic  minimax  solutions,  since,  asymptotically,  one 
could  do  better  by  estimating  the  true  underlying  distribution 
(see  also  Stein  (1956),  Ha'jek  (I962)).   However,  this 
seems  to  require  an  exorbitant  number  of  observations. 
On  the  other  hand,  it  is  hoped —  and  preliminary  niomerical 
experiments  seem  to  substantiate  this  hope — that  (M) -estimators 
approach  their  asymptotic  behavior  rather  fast,  provided  f   is 
bounded.   So  an  asymptotic  minimax  theory  should  be  useful 
in  those  frequent  cases  where  the  sample  size  is  perhaps 
large  enough  to  indicate  deviations  from  the  assumed  model 
but  not  yet  large  enough  to  establish  their  nature." 

If  we  apply  to  the  problem  considered  by  Huber  the 
systematic  generalized  Pitman  estimator  discussed  in  Chapter  3, 
we  have  to  select  a  finite  subset  C»  of  symmetric  shapes 
in  C.   (We  consider  only  S5rmmetric  shapes;  the  usefulness  and 
even  the  very  meaning  of  th^  problem  of  estimating  location 
of  a  family  including  various  asymmetric  shapes  seems 
questionable,  and  in  any  case  requires  further  clarification.) 
However,  our  estimator  reaches  full  efficiency  as  k  tends 


to  00  for  each  member  of  C,  whereas  this  is  true  only  for 
a  particular  ("least  favorable")  shape  in  the  case  of  the 
(M)-estlmator.   Since  C  may  be  made  arbitrarily  large,  it 
may  be  possible  to  choose  it  so  as  to  obtain  a  very  effective 
representation  of  C. 

Furthermore,  it  seems  not  unreasonable  to  conjecture 
that  the  systematic  gene.ralized  Pitman  estimators,  for  small 
but  carefully  chosen  families  of  shapes,  may  approach  their 
asymptotic  properties  even  for  moderate  sample  sizes,  since 
they  are  functions  just  of  sample  quantiles.   The  latter 
feature,  incidentally,  also  makes  these  estimators  highly 
tractable  computationally,  unlike  some  other  estimators 
proposed  for  efficiency-robustness  (including  Ruber's);  this 
will  facilitate  the  proposed  numerical  investigation  of 
their  properties,  and  their  possible  future  applications. 


Chapter  2 
THE  GENERALIZED  PITMAN  ESTIMATOR 

2.1.   Introduction;   The  Pitman  Estimator 

We  shall  assume  henceforth  that  we  have  a  fixed  number 
n  of  independent.  Identically  distributed  random  variables 
X^,...,  X^  with  probability  density  f(x|e)  =  f(x  -  6), 
\6\    <  00 ,  where  f{x\6)    Is  continuous  in  x  and  9   and  is  a 
symmetric  function  of  x.   Many  of  our  results  hold  without 
the  assumption  of  symmetry,  but  only  the  symmetric  case  is 
of  ultimate  relevance  here.   These  assumptions  on  f  will  be 
referred  to  subsequently  as  Condition  1  or  (CI). 

(The  underlying  probability  model  is  of  course  the 
usual  one:   A  sample  space  9^,  a  cr -algebra  -^  of  subsets  of  5^, 
and  a  family  of  probability  measures  Pg,  0  e  o,  defined 
over  (^,<?5) .   In  our  case  O  =  0  =  R,  and  f(x|0)  is  the  proba- 
bility density  of  Pg  with  respect  to  Lebesgue  measure.) 

In  what  follows  we  will  use  the  simplified  notation 
(x.  signifying  an  observation  on  X.), 

(2.1)  (x-^,...,  x^)  =  X 

and 

TT  f(xje)  =  f(x,0)  . 
i=i   ^ 


Then  if  xf{x)  =  0(1)  as  x  — >  t   oo  and  x  — >  0,  a  possible 
estimator  of  the  location  parameter  0   Is  given  by 


(2.2) 


(x) 


L 


ef(x,e)de 


f(x,0)de 


Pitman  (1959),  who  proposed  this  estimator,  proved  that 
It  (a)  Is  translation-Invariant,  (b)  has  variance  Independent 
of  G,    (c)  Is  unbiased,  and  (d)  has  mlnlm\;mi  variance  among 
all  translation-Invariant  estimators.   Glrshlck  and  Savage 
(1951)  showed  In  addition  to  (a)  -  (d)  that,  given  certain 
regularity  conditions,  ©*(x)  Is  mlnlmax  with  respect  to 
squared  error  as  loss  In  the  class  of  all  estimators. 
Karlln  (I958)  established  Its  admissibility;  an  alternate 
proof  imder  weaker  conditions  was  given  by  Stein  (1959) 
in  the  following  theorem. 


THEOREM  2.1.      If 


(2.3) 


An 


f(x,e) . 


J. 


"f{x,0)de 


/        f(x,e)d0 

J  -co 


/        ef(x,e)d0 
J  -00 
00 

/  f(x,e)d0 


5/2    n 

n  dx.  <  00 


1=1 


10 


then  0*(x)  as  defined  by  (2.2)  Is  admissible.   That  Is,  if 

As 

^{^)    Is  any  estimator  for  which 


Efl[0(x)  -  ef   <   E«[e*(x)  -  ef     for  all  6   , 


then  equality  holds  for  all  0. 

For  future  reference  we  Indicate  here  the  general 
setting  In  which  the  proof  Is  carried  out. 

Let  j^  be  the  ( n-1 ) -dimensional  real  coordinate  space, 
cf  the  a-algebra  of  Borel  subsets  of  Z/,    and  v  the  distribution 
of  Y  defined  by 

/  xf{x)f(x+y, )  ...  f(x+y^  .)dx 

(2.4)       g(y)  =4 1 S=l , 

/  f(x)f(x+y^)  ...  f(x+y^_-L)dx 

where  y  =  (y-,,...,  yn-l^'  ^^'^   X  Is  a  random  variable  on  (R,^). 
Define  the  mapping 


(2.5)  ^1  =  ^2  -  ^ 


2   ^^1 

\-l  =\-\ 

X  =X-L  -  g(Y,,...,  Y^.^)  . 

Then  the  conditional  density  of  X  given  Y  is  given  by 


11 


f(x+g(y))f{x+g(y)+y  )  ...  f(x+g(y)+y   .) 
(2.6)    P(x,y)  =  -^ ^^^^ 


J      f(x)f(x+y-L)  ...  f(x+y^_i) 


dx 


X  Is  the  estimator  0*(x),  and  the  condition  for  admissibility 
(2.2)  reduces  to 


(2.7)  J  dv(y)(  J   x2p(x,y)dx) 


3/2  . 


In  subsequent  sections  we  will  refer  to  (2-3) j  Stein's 
condition  for  the  admissibility  of  the  Pitman  estimator 
(which  implies  its  existence),  simply  as(C2). 


12 


2.2  The  Generalized  Pitman  Estimator 

Blrnbaum  (I961)  has  given  a  general  method  for  constructing 
efficiency-robust  Inference  techniques.   Application  of  this 
method  to  the  Pitman  estimator  is  discussed  in  Blrnbaum  and 
Laska  (1967a). 

A   brief   siAimnary   of  the  method  as   it  pertains  to  ovir 
problem  follows. 

Assume  given  a  family  of  m  densities  f(x  -  0,A), 
A  e  A=[l,...,    m],    satisfying  (CI)  and  (C2).   The  additional 
parameter  here  signifies  that  the  observations  x,,...,  x  may 
have  come  from  any  one  of  the  m  distributions  or  "shapes" 
defined  by  f(x  -  0,A).  0   is  the  unknown  parameter  of  interest, 
and  A  plays  the  role  of  a  nuisance  parameter;  thus  the 
parameter  space  for  this  model  is  Q  =  0  XA- 

For  the  efficiency-robustness  problem,  realistic  assumptions 
Include  continuous  families  of  shapes;  the  present  model  is  a 
simplified  one  whose  ultimate  value  and  possible  limitations 
require  further  investigation.   (Cf.  also  discussion  in 
Blrnbaum  and  Laska  (1967a,  b)). 

Let  T  =  T(x)  be  an  estimator  of  9   and  let  R  =  R(T,0,A) 
denote  any  specified  risk  function,  e.g.  mean  squared  error. 
Admissibility  of  T  with  respect  to  R  in  a  specified  class  of 
estimators  is  defined  in  the  usual  way;  namely,  if  there 
exists  an  estimator  T'  =  T'(x)  ^^  ^^^   given  class  such  that 

R(T',e,A)  <  R(T,e,A),   V  (0,A)  e  O  , 
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then  equality  holds  V(e,A)  e   O- 

Since  we  assume  that  a  smaller  value  of  R  Indicates 

greater  efficiency,  the  object  is  to  find  a  T  that  somehow 

"jointly"  minimizes  R(T,0,A),  (0,A)  e  O-   In  general  no  T 

is  uniformly  best  over  Q. 

Given  a  probability  distribution  G  over  A,    i.e., 
m 
G  =  (g^,...,  g^),  ZZ  Sx  "  ^'  ^X  ^  ^'    ^®  introduce  the 


A- 
so-called  'fixture  model" 


(2.8)  f(x,0,G)  -  f     f(x,e,A)dG(A) 

g^f(x,e,A)  . 


m 


The  risk  function  of  an  estimator  T  under  the  mixture  model 
is  then  given  by 

n 
(2.9)   R(T(x),e,G)  =  /  ^  L(T,0)  TT  f(x.,0,G)dx 
^R^        i=l     ^       ^ 

=  f:gx/^nL(T'^)  TT^  f(-,e,A)dx. 
g^  R(T(x),e,A)  , 


m 


where  L(T,e)  is  the  loss  function  corresponding  to  R, 
We  now  have  the  following 
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LEMMA  2.1.   Any  estimator  T  -  T ( x )  admissible  over  ® 
under  the  mixture  model  Is  also  admissible  In  the  usual 
sense  over  @  K  A . 

Proof:   AssiAme  T(x)  Is  Inadmissible  over  ®  X  A  with 
respect  to  R  In  a  given  class  of  estimators.   Then 

R(T',0,A)  <  R(T,0,A),   V(e,A)e@XA 
and  3  (e',AMe@XA  3 

R(T',0SA')  <  R(T,9',A')  . 

Now  V  0  7^  0' 

m 
R(T',e,G)  =  ^  g,R(T',0,A) 
A=l   ^ 

m 
<   IZ  g.R(T,0,A) 

=  R(T,0,G) 
and 

R(T',0',G)  =  VI  g,R(T',e',A)  +  g.,R(T',e',A') 

m 
"  ^—   g.R(T,0',A) 


=  R(T,0',G)  . 
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But  this  contradicts  the  assiimption  that  T  is  admissible 
under  (2.8). 

It  needs  to  be  emphasized  that  the  "prior"  distribution 
G  employed  in  the  mixture  model  has  no  extra-mathematical 
significance  and  is  selected  in  any  particular  context 
because  it  yields  an  admissible  estimator  with  additional 
desirable  properties. 

Clearly  estimators  admissible  in  the  sense  defined 
above  also  fit  the  description  "fully  efficiency-robust," 
as  discussed  in  Chapter  1. 

Now, for  any  given  G,  computing  the  Pitman  estimator 

for  the  mixture  model  (2.8)  yields  the  generalized  Pitman 

estimator 

/     0f(x,0,G)d0 

•J    -CD 


(2.10)     0*(x) 


CO 

■oo 
G'— '    r  oo 

oo 


/     f(x,0,G)d0 
J  -oo 


m      ,-  oo 

y—  g^  /   0f(x,0,x)d0 

A^  ^^  -00 

m      -  oo 

f(x,0,A)d0 
oo 


/  ~  f(x,0,A)d0 
J  -m 


00 

m   &A 


s 


-,    m       oo  A 

^-^     J~   g,  ■/     f(x,0,j)d0 


(x) 


.1=1   ^  ^  -c 
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A     A 

S^^  =  Sa^-^  ^^  ^  formal  "posterior  probability"  of  shape  A. 
The  following  results  are  Immediate-... 

THEOREM  2.2.   The  generalized  Pitman  estimator  9* 
defined  by_  (2.10) 

(a )  1e_  unbiased 

(b)  has  variance  Independent  of  0  V  A,  and 
(c  )  ls_  admissible  (over  0  X  A  )  • 

Proof;   (a)  and  (b)  follow  from  the  fact  that  the 
regularity  conditions  assumed  for  each  shape  f(x  -  0,A) 
carry  over  to  the  mixture  model  f(x  -  d,G),    so  that  the 
results  established  by  Pitman  (1939)  and  Glrshlck  and 
Savage  (I95I)  for  a  single  shape  are  Immediately  applicable, 

(c)  Is  a  special  case  of  Lemma  2.1.   We  have 

(2.11)      Var(0*|G)  ^  /    ( 0*  -  of  IJ   f(x.,G)dx. 
^  J  R^  1=1   ^     ^ 

A=l   ^^r'^    ^        1=1     ^      1 

m  ^ 

=  Z:g,  Var(0^|A)  . 


Now  assume  6     Is  Inadmissible.   Then  there  exists  d   =  6{x)    such 
that 


(2.12)       Var(0|e,A)  <  Var(e^lA)  ,    V(A,0) 
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with  inequality  strict  for  at  least  one  (A,©),  say  (A',^'). 
Then,  for   9  j^  6^ , 


(2.13)    Yar{e\9,G)    =  y~     g^  Var  (  e  |  e,  A  )  +  g,  ,  Var  (9  |  6,  A' ) 
A?A'   ^  ^ 


g^  Var(0(.|A) 


Var(0g|G) 


and 


(2.14)   Var(e|e',G)  =  y~  g.  Var(  6  |  e « ,  A )  +  g,  ,  Var(  0  |  0 ' ,  A) 
aTa'  ^  '^ 


=  Var(e*|G)  , 

which  contradicts  the  admissibility  of  9  over  ^  (under  the 
mixture  model)  following  from  Theorem  2.1.  This  proves  the 
theorem. 

As  an  example  of  the  generalized  Pitman  estimator, 
consider  the  case  of  m  =  2,  that  is,  2  possible  shapes,  with 
A  -  1,  2  and  G  =  (g,l-g) • 

We  have 

„   g  /  ef(x,e,i)de  +  (i-g)  /  ef(x,e,2)de 

(2.15)    0p  =  — ^^ ■ 

^   g  ,  f(x,e,i)de  +  (i-g)  /  f(x,0,2)d0 

-  pA  +  (l-g)B 

-  gC  +  (l-g)D   • 
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Let 


and 


f(x  -  0,1)  =  J—   exp  {-  1  (x  -  0)2  } 


f(x  -  0,2)  =  ^-  exp  (-  v/^  |x  -  0|  }  , 


the  normal  and  double  exponential  densities  with  imit  variance. 
We  obtain,  for  sample  size  n. 


and 


^2^^(n-l)/2y^       2  ^^-L    1 


C  -  ,,  Jn-l)/2^  ^^Pt-  I  IZ  (^1  -  ^)^] 


(2.r)^^^^=W2y^ 


i-1 


In  the  following  expressions  for  B  and  D  the  undefined  terms 

0  n 

""^^   X.  and  J^    X.  are  to  be  set  equal  to  zero,  x   =  -  oo 
t^  ^    ltK+1   ^ 

and  X   -,  =  +  oo .   We  have,  for  n  odd, 

exp[  /2  (\ X.  -  ^  X  )] 

^  -  i^l   ^   1=^,1+1   ^ 


piT+IT^  3^ 


2j  -  n 


i  [x   +  1 ]  exp[-  ./2  (2j  -  n)x  ] 

I   J   /2(2J-n)  '^ 


[x,^T  + 


■J+^   /2(2j-n) 
and  for  n  even , 


]  exp[ 


-  /2  (2j  -  n)Xj^^]  I  , 
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exp[  /2    iYZ  X.    -  ZI     ^1 


)] 


[X,    + 


J        /2(2j-n) 


2j    -  n 


]    exp[-  V^(2j-n)x.] 


[x,^n    +  ^r-^= ]    exp[-  v/^(2j-n)Xj_^^] 


'J+^       V^(2j-n) 


1  r^2  ..2. 


n/2 


2  2-  +  ^  2 


-  '  i^T.i " 


For  n  odd,  j  j^ 

^        exp[  /2    (TI  X.    -  yZ     X    )] 
^  1  ^     1^1      ^      1^,1+1      ^ 


2j    -  n 


.  {  exp[-  v/2(2j    -  n)x  .]    -   exp[-  y2(2j    -  n)Xj^^]  ] 


and  for  n  even. 


exp[   /2    (XI  X.    -  y~     ^1^] 

1  v^       ±^     ^      l^I+l 

2j    -  n 


Jn+1172~    511 


•   {  exp[-v/^{2j    -  n)Xj]    -   exp[-  /2(2j    -  n)Xj^-L]  } 


n/2  n 

+  -^  f^n  -  ^n^    ^^P^  >/2    (2l  ^i   -  2Z       ^1 

2^       §+1  I  fe     ^   i     n^. 


)] 
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2.3'   Asymptotic  Properties 

If  as  "prior  density"  of  9   we  admit  any  non-negative 
measurable  function,  the  Pitman  estimator  may  be  considered 
as  the  mean  of  the  posterior  distribution  of  6   given  X  for 
the  case  of  the  "generalized  uniform  distribution" 
(I.e.,  Lebesgue  measure)  over  the  real  line.   In  this 
context  It  Is  also  the  so-called  generalized  Bayes  solution, 
with  squared  error  as  loss. 

The  asymptotic  normality  of  the  posterior  distribution 
for  arbitrary  smooth  priors  and  binomial  X  was  proved  by 
Bernstein  (I917)  and  Independently  by  von  Mlses  (I919). 
For  the  special  case  of  a  uniform  prior.  It  had  already 
been  proved  by  Laplace  (I82O),  under  rather  strong  assumptions. 
That  the  Bernsteln-von  Mlses  result  holds  for  general  smooth 
f(x|e)  was  shown  by  Wolfowltz  (1953).   These  and  related 
results  are  discussed  In  the  rigorous  study  of  the  asymptotic 
properties  of  Bayes  solutions  and  maximum  likelihood  estimators 
carried  out  by  LeCam  (1953,  I958).   Additional  work  Includes 
publications  by  Llndley  (I96I,  1965)^  Hartlgan  (I965),  and 
Elfvlng  (1966). 

Prom  the  extensive  literature  on  the  subject  we  will 
make  use  only  of  the  single  result  that,  under  broad  regularity 
conditions,  the  mean  of  the  posterior  distribution — In  our 
case  the  Pitman  estimator  for  a  single  shape — Is  asymptotically 
equivalent  to  the  corresponding  maximum  likelihood  estimator. 
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Specifically,    we  have 


(2.16)       e*(2L)  =  OU)  +  Op(i)  . 


DEFINITION.   Given  a  sequence  of  random  variables 
{^  },  we  write 


(2.17)  ^^  =  Op(rJ 


i.e.,  Uj^l/^v,  is  bounded  in  probability,  if 
Ve  >0,  3m,  N3Vn>N 

We  are  adopting  here  the  general  notation  of  Chernoff  (I956) 

A    sketch  of  (2.l6)  will  be  given  below,  in  the  form 
needed  subsequently.   The  method  involves  expansion  of 
log  f(x,0)  in  Taylor  series  about  0(x_).   Following  Lindley 
(1961)  we  will  consider  only  [6   -   6)    =   0  (n~^/^)  and  will 
integrate  over  the  entire  range  of  6.      It  can  be  shown, 
using,  e.g.,  methods  of  Hart igan  (I965)  and  Elfving  (I966), 
that  the  omitted  integrals  over  the  tail  areas  in  question 
are  negligible. 

PRELIMINARIES.   The  following  conditions  are  assumed 

to  hold  for  the  \mderlying  density  f(x  -  0);  let  9   =   ©^ e  © 

be  the  true  value  of  the  parameter,   (These  conditions  are 

not  all  necessary  in  this  form;  a  more  precise  formulation 

will  be  worked  out  later. ) 
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(C2).   The  Pitman  estimator  exists  and  is  admissible, V  n. 
(C3).   OV^e^)  log  f(x  -  6),  1  1  r  <  4,  exists  V  x  and  6. 
(C4).   |E(''^0,e')l  =  |E0[(aVS0'')  log  f(X  -  e)|g  ^  0,]|  <  00. 
1  <  r  <  4,   V  e,  0'  . 

(C5).  |E(e,0Ml  =  IEq  log  f(x  -  0')1  <  00,  \/e,   e>  . 
(c6).  0  <  i(e,0')  <  00,  V  0,  0'  , 

where 


and 


1(0,0')  =  E0[(V^0)  log  f(X  -0)|0  ^  Q,f 


1(0,0')  =  -  E^^^0,0M  . 


(Here  nl(0  ,0  )  Is  Fisher's  Information.) 

(C7).   The  maximum  likelihood  estimator  0(x)  is  the  unique 
solution  of  the  likelihood  equation  for  n  >  N,  for 
some  N  and  V^;  and  the  MLE  is  asymptotically 
N(0^,  l/nl(0^,0^)). 

When  considering  the  family  of  densities  f(x,0,A), 
A  e  A ,  we  assume  that  in  addition  to  the  above,  the 
following  conditions  hold.   Let  A  -  A^  e  A  be  the  true  value 
of  the  shape  parameter. 
(C5.a).   |E(0,0';A,A')|  =  lE^^^^^Llog  f(X  -  0'|aM]|  <  oo , 

V  (0,A),  (0',AM  e   o  . 
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(C6.a).   0  <  I(e,0';A,A')  <  co ,   V(0,A),  (0',A>)  e   o, 
where 

i(0,0';A,A')  =  E(g^^j[(VS0)  log  f(x  -  e|A')le  ^  g,]^  , 


and 


I{e,e';A,A')  =  -  E(2)(0,e';A,A')  . 


(We  note  that  i^I(  ^q' ^o'^o'^o^  ^^  Fisher's  Information 
for  shape  A^. ) 

(C7.a).   0-v(x),  the  unique  root  of  the  likelihood  equation 

A  — 

determined  by  shape  A,  Is  asymptotically 
N(e^,  l/nI(e^,0^;A^,A^)). 

When  referring  to  the  expressions  defined  above  we  will 
adopt  the  simpler  notation  of  writing  when  appropriate  e.g. 


E(e,0)  =  E(0) 
E^^^0,e)  -  E^^^Q) 
1(0,0)  =.  1(0) 


I(0,0';A,A)  -  I(0,0';A) 
I(0,0;A,A' )  =  I(0;A,A') 
I(0,0;A,A)  =  I(0;A)  , 

and  similarly  for    E( 0, 0' ;A, A> ) . 

(C7)  and  (C7.a)  are  satisfied  by  the  normal,  logistic, 
and  double  exponential  distributions,  but  not,  e.g.,  by  the 
Cauchy.   Under  weakened  assumptions  admitting  the  Cauchy 
distribution,  one  could  allow  multiple  roots  of  the 
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likelihood  equation  such  that  the  probability  of  more  than 
one  root  — >   0  with  n;  this  case  will  not  be  discussed  here. 

We  note,  in  addition,  that  since  f(x  -  6)    belongs  to  the 
translation-parameter  family,  the  functions  in  {C4)-(C7) 
are  actually  independent  of  6   for  arguments  of  the  form 
{0,6),    and  depend  only  on  (0  -  0')  when  the  argument  is 
(0,0'). 

Repeated  use  will  be  made  of  two  theorems  which  we 
state  here  without  proof. 

THEOREM  2.3.   (Wilks  (I963),  P-  105),^^  (X-j^,  X^,...) 
be  a  stochastic  process  depending  on  a  parameter  0  such  that 
for  each  value  of_  0  ln_  some  interval  (0»,0"),  f  (x,,...,  x    ,    ( 
n  :=  1,  2,...,  and_  0  (x,  ,  .  .  . ,  x  ),  n  =  1,  2,...,  are  sequences 
of  random  variables  converging  in  probability  respectively 
to  g  (  0 )  and  0  uniformly  with  respect  to  0  ln_  (  0 ' ,  ^ "  ) ,  where 
g(0)  is  continuous  in  (0>,0").   Then  f^(x-^,  .  .  .,  x^,  0*), 
n  =  1,  2, . . . ,  converges  in  probability  to  g( 0) . 

THEOREM  2.4.   (Slutsky;  e.g.  p.  102  in  Rao  (I965)). 


Let    (X^,Y^),    n  =  1, 

2,...   be. 

a_ 

sequence   of  pairs   of  random 

variables 

Then 

(a) 

x„^x. 

Y„i>0 

=> 

Vn  -^  0 

(b) 

x„^x. 

.„i.c 

— > 
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In  addition  we  will  need  the  following 

LEMMA  2.2.   Assume  f(x  -  0)  satisfies  (C2)-(C7)  and  let 


(2.18)        L^(x|0)  =  2Z^  1°S  f(^i  -  ^)*   1  <  r  <  4, 
^  ~     1=1  se^       ^ 


Then 


(2.19)  L^(xl0(x))  -  Op('^) 


where  ©(jc^)  Is  the  maximum  likelihood  estimator. 

Proof;   By  (C4)  E^^\e^,9)    exists,  so  that 
weak  law  of  large  numbers, 

(2.20)  i  L^(x|e)  —>   E^^^e^,©)  <   00 


By  the  consistency  of  the  maximum  likelihood  estimator,  we 
have 

(2.21)  e(x)  — >  ^o  • 

Application  of  Theorem  2-3  now  yields 

(2.22)  ^-L^{x\e)   I->   E^^^e^)  <  oo, 

i.e.,  ^  • 

L^(x|S)  =  Op(n)  . 

THEOREM  2.5.   Assume  given  a_  sequence  of  independent 
random  variables  X^,  X^,  . .. ,  with  p.d.f.  f(x  -  0)  satisfying 
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(C1)-(C7).   Then  the  Pitman  estimator  may  be  written 

n    00 

/   0f(x,e)d0 
(2.23)  0*(x) 


•oo 


^  00 

/     f(x,0)d0 
J  -oo 


>M   +  Op(i)  . 


Proof;   We  consider  first  the  denominator;  for  simplicity, 
we  write 

(2.24)        L(x|0)  =  log  f(x|e)  =  ^   log  f(x^|0) 
and 


then  expand  L(xle)  In  a  Taylor  series  about  0.      We  obtain 
(2.25)   L(x|e)  =  L(x|0)  +  I  (0  -  e)^  L^{x\e) 

+  i,  (e  -  0)^  L^(x|e)  +  J,  (0  -  0^^)"^  l^(2l|0') 

for  some  0'  in  the  interval  (0-0,0+0).   By  (2.20) 


1 
n   "4 


L„(x|0')  ^>  £^^^0^,0') 


L|^(x|S')  =  Op(n)  , 


27 


so  that   we  have 


(2.26)     f(x|e)  =  f(x|e)  exp{|  (e  -  e)'^  l^CxI©) 

+  i,    (e    -   0)5  L^(xl^)    +  Op(n-^)} 
=   f(x|0)    exp[i   (e    -    6)'^  l.^{x\e)][\   + 


where 


and 


+  i,    (0  -  ey  L^(x|e)  +  OpCn"-") 


i  [t[e)f   exp[6t(0)}] 


0   <   6    <   1 


(2.27)  t(0)    =  i,     (0    -    0)5   L^(2lI^)    +  %(n~^)     • 

(Let  t(e)  =  y;  we  have  substituted  above  e'^^l+y  +  ^-y  e^j 
valid  for  arbitrary  y. )   Calculating  the  order  of  the 
remainder,  we  obtain 

(2.28)  I  [t(0)]2  exp{6t(0)}  =:  Op(n-^)0(exp(n-^/^}) 

=  Op(n-l)  , 

^  '  1/2 

Since  by  assumption  (0  -  0)  =  0  (n  ^  ).   Thus  we  have 

(2.29)  rf(x|0)d0  =  f(xl0)  /  d0  exp{i  (0  -  0)^  1^^{^\Q)] 

•  [1  +  3-j(e  -  0)^  L^(xl^)  +  Op(n~^)]  . 
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Integration  with  respect  to  0  yleldj 


(2.30)   /  f(x|0)de  =  f{x\e)\  _^2t   "ji/2  ^^  ^  ^  ,1)^ 


where  by  (C6),  requiring  a  unique  root  of  the  likelihood 
equation,  Lp(x|0)  <  0. 

Next,  consider  the  numerator  of  (2.23) •   Write 


(2.31)  /  ef(x|e)d0  =  e  i   f(x|0)de  +  /  (e  -  0)f(x|e) 


ae   . 


since  we  have  already  evaluated  the  first  Integral  on  the 
right-hand  side,  only  the  last  term  needs  to  be  examined. 
Completely  analogously  to  the  previous  arguments  we  obtain 

(2.32)       /    f(xl0)d0   =   f(x|e)  j    d0   exp{i  {9   -   of  L2{x|^)} 

•   [(0  -  e)  +  j;   (e  -  e)^  l^(2lI^)  +  Op(n-5/2j] 

.  .(.,e)r_-r  11/2  r^^'f]   . 0p(n-v2)] 

iL^(x|0)i        h-y.Ll{x\e)       P  J 


^2^-1 


,|e)r_:2.    11/2  r    (^-iJ 


Substituting  (2.30),  (2-31),  and  (2.32)  In  (2.23)  yields 
0^(n-^)    .        -, 

(2.33)     e*  =  e  +  — E ^  =  e  +  o^(n  "■)  . 

1+0  (n-1)        P 
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COROLLARY  2.1.   Under  the  assumptions  of  Theorem  2.5 
the  Pitman  estimator  has  asymptotic  distribution 
N(0  ,  l/nl(0^)),  where  0^   ls_  the_  true  value  of  0  and  nl(0^) 
Is  Fisher's  Information.   That  Is,  6*  is_  a_  BAN  estimator. 

Proof;   Since  by  assumption  0  is  a  BAN  estimator,  the 
result  follows  from  part  (b)  of  Theorem  2.4,  with  c  =  0. 

We  now  come  to  the  main  result  of  this  section,  namely, 
that  as  n  — >  oo  the  generalized  Pitman  estimator  is  not  only 
admissible,  but  has  full  asymptotic  efficiency. 

THEOREM  2.6.   Let  f(x,0,A),  A  =  1,  2,...,  m,  be  m 
densities  satisfying  (C1)-(C7),  and  let_  Q   =   ^^   and  A  =  A^ 
be  the  true  parameter  values.   If  V  (0,A)  /   (^o'^o^ 
f(x,0,A)  differs  from  f(x,0  ,A  )  on  more  than  a  set  of 
measure  zero,  then  the  generalized  Pitman  estimator  ^ni^i)   has 
asymptotic  distribution  N(0g,  l/nl{ 0^;Aq) ) ,  where_  I(0^;Aq) 
is  Fisher's  Information  for  shape  A  ,  that  is,  9     ls_  BAN 
uniformly  over  the  respective  shapes. 

COMMENT.   Familiar  examples  show  that  the  maximum  like- 
lihood estimators  for  individual  shapes  are  in  general  BAN 
only  under  a  single  shape.   For  example,  we  have  the  normal 
and  double  exponential  shapes,  for  which  the  maximum  likeli- 
hood estimators  are  the  mean  and  the  median,  respectively, 
neither  of  which  is  uniformly  BAN  over  both  shapes. 
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Proof:   The  form  of  the  generalized  Pitman  estimator 
was  given  by  (2.10).   Substituting  for  the  Integrals  therein 
the  asymptotic  expressions  (2. 30)  -  (2.31),  with  the 
appropriate  addition  of  the  Indices  A,  we  obtain 


oo 

Id© 


g      /  f(x|0,A)( 

^  -co 


(2.3^)         ^G=TZ  ~ 0 

A=l      m  ■   00 

y~  g.  /        r{x\G,j)de 

,1=1        ^  ^  -00 


gf(.i0  ,)r_^22i_ii/2 

m        ^    -    ^       '-L  re   lA)  -J 


^^ iK    +  O^(n-l)], 


-2Tr        "]  1/2        '^  P 


where 


L2(0^|a)    =  L^{x\e^,-K) 


Without  loss  of  generality  assume  that  A   =  1  Is  the  true 
value  of  the  shape  parameter.   Then  (2.34)  can  be  written  as 

(2.35)  e*   =   1  ^ [0^  +  0^^^"^)] 

1  +  f- !£^iVi!r  v^ii/2         "^ 

j=2  g^ru\e^,i)^L^{e  \j)  i 


a^^^lI^.a)  rL^eJi)n,/2 


"fiTj^TiT  LL2(e^|A)  -I 


^   ^  [0^  +  0  (n-^) 


^^^£/(^ViirV^ii/2 

j^  g^f(x|0^,i)L  L^TeTjT)"  J 
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Now  by  the  consistency  of  the  maximum  likelihood  estimator. 
(cf.  (C7.a)), 


(2.36)      \   =   \{^)   -^  ©o  ,  V  A  e  A  . 


By  the  weak  law  of  large  numbers  and  by  (C6.a) 
(2.37)     iL^Cxle.A)  -^>  E(2^0^,e;l,A) 


=  E-  1)  [^iogf(x,e,A)]  , 


V  A  e  A  . 
Application  of  Theorem  2.J>   now  yields 


so  that 


V^iliili/2  p    rilViLli/2 


,.,8)       p2^-^i'-Mi/2  2,.  r_^i:oii!-r 


Next  we  will  show  that 


f(x  0,,A)  p       ,,    , 

(2. 39)         -^^;^r^ — >  0'   Va^I 

f(x|^l»l) 


Let 
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(2.40)     u  ^  -.::^ =  T[ 


f(xl0^,A)    n    f(x^|0^,A) 


/^     ) 


"^        f(x|0-L,l)    i=l   f(x^|e^,l) 


and  let 


(2.41)   v^  =  i  leg  u^ 


By  the  weak  law  of  large  numbers  and  by  (C5.a) 
i2A2)  i^  log  f(x.,e^,A)  ^>  E(e^;l,A) 

=  E,Q      -^Jlog  f(X|e   A)],   V  A  e  A  . 
Now  from  (2.36)  and  Theorem  2.3  we  obtain 
(2.43)   iji:  log  f(x^|e^,A)  ^>  E(g  ^i)[log  f(X|0^,A)] 

V  A  e  A  , 
so  that 

f{x,eQ,A)- 


)  . 


r    f{X,e  A)-, 
(2.44)     v^  -^>  E(     )[log  7p^;e^J=  E^(v)  . 


Now,  following  Hartigan  (I965)  we  have,  by  the  convexity  of 
the  exponential  function, 

(2.45)  exp[E-L(v)]  <  E-^[exp  v]  -  1, 
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since 

00    f(x,0    ,A) 


E^[exp  V]  =J_    rnrr^^  f(x,e^,i)dx  =  i. 


The  equality  holds  only  If  v  =  c  with  probability  1.   But 

by  assumption  f(x,0^,A)  differs  from  f(x,0^,l)  with  probability 

>  0,    so  that  V  Is  not  constant  and 


(2.46)  exp[E-L(v)]  <  1 

or 


o- 


that  Is 


so  that 


V  -^  E,  (v)  =  -  k,    k  >  0, 


P 
nv  — >  -  OD 


and 

(2.47)  u^  =  e  ^  X 


Repeated  application  of  Theorem  2.4  now  completes  the  proof 
of  the  theorem:   Let 

(2.48)   e*  =  d^[e^  +   Op(n-^)]  +  XZ  \^\  +   Op(^~"^)] 

A=2 
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where  d,  ,  A  e  a>  a^e  the  coefficients  shown  In  (2.35)  and 
A 


* 
(d^  Is  the  approximate  form  of  the  coefficient  g  defined  in 
A  '^ 

(2.10).)   From  (2.38)  and  (2.39)  we  see  that 


(2.49)  d-L  =  d^(x)  — >  1 

and 

d^  =  d,  (x)  ^>  0  ,     A  -  2,...,  m. 

A      A  — 


By  assumption  (C7.a) 


Thus 


and 


so  that 


^^  A>z,  .N(e^,  ,1(^1,,))  • 


i/^  A>  z,  .  N(e^,  Hwjry)  ' 


'A  ^  ° ' 


Op(n-l)   -^>  0    , 


(2. 50)  el  ^>  Z^  ^  N(e^,    l/nl(0^;l)) 

as   claimed. 
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Chapter  3 

THE  SYSTEMATIC  GENERALIZED  PITMAN  ESTIMATOR 

3.1.   Introduction;   Systematic  Estimation 

Mosteller  (1946)  proposed  a  method  of  estimation  for 
large  samples  based  on  only  a  subset  of  the  observations. 
It  involves  replacing  the  observations  x-,,...,  x  by  the 
ordered  set  of  their  values. 


^(1)  ^  ^(2)  ^  •••  "   ""(n) 


and  then  for  a  fixed  set  of  k  Integers  n^,  1  =  1, — ,  kj 
selecting  the  set  of  sample  quantlles  x,  n,.--,  x,  y 
such  that 


^(n^)  "^^(n^)  ^    '"    ^^(n^)  ' 


The  sample  quantlles  are  determined  by  the  relation 

nA.      if  nA.  is  an  Integer 
(3.1)       n  =  ^ 

[nA. ]+l   otherwise 

where  0  <  A.  <  Ag  <  • • •  <  A^  <  1  is  a  given  set  of  k  real 
numbers . 

Although  Mosteller  defines  any  function  of  the  ordered 
observations  as  a  "systematic"  statistic,  we  will  restrict 
the  latter  term  to  the  method  actually  Introduced  by  him. 
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I.e.,  the  use  of  only  a  selected  set  of  sample  quantiles. 

Mosteller's  proposal  was  motivated  by  the  coming  into 
use  of  punched-card  equipment  which  could  provide  an  economical 
way  of  sorting  (ordering)  large  masses  of  data.   Tedious 
further  computations  would  then  be  restricted  to  the  small 
selected  subset. 

A  significant  contribution  in  this  area  was  his  proof 

that  under  mild  regularity  conditions  the  k  sample  quantiles 

X/   ,,...,  X,   X  have  as  limiting  distribution  the  k-variate 

V^i]^)       ^  ^k 
normal,  with  mean  and  covariance  matrix  of  known  form. 

(This  result  had  been  proved  earlier  for  k  =  1,  2  by  K. 

Pearson  and  Smirnoff;  for  references  see  Mosteller  (1946).) 

As  a  first  example  Mosteller  suggested  using  the  mean 

of  the  selected  subsample  as  an  estimator  of  location  and  he 

computed  the  relative  efficiency  of  this  estimator  based  on 

different  sets  {  A^,  i  =   1,...,  k},  k  <_  10,  when  the  underlying 

distribution  was  assumed  normal.   In  each  case  the  variance 

of  the  estimator  was  computed  from  the  covariance  matrix  of 

the  asymptotic  k-variate  normal  distribution  and  compared  with 

the  variance  of  the  full-sample  mean.   For  all  spacings  {  A ^^ } 

considered,  the  relative  efficiency  exceeded  -9  already  for 

k  =  5. 

Ogawa  (1951),  in  an  extensive  study  of  this  approach, 
applied  the  generalized  Gauss-Markov  theorem  to  the  covariance 
matrix  of  the  limiting  multivariate  normal  distribution, 
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obtaining  so-called  best  linear  unbiased  estimators  (BLUE's) 
for  this  k-varlate  model,  pre-datlng  the  well-known  paper 
of  Lloyd  (1952).   He  also  studied  the  problem  of  optimum 
spacing  of  the  sample  quantlles  and  showed  (as  was  to  be 
expected)  that  the  BLUE's  based  on  these  quantlles  provided 
relative  efficiencies  for  the  normal  distribution  even  more 
favorable  than  those  obtained  by  Mosteller. 

Extensions  of  Ogawa»s  work  were  given  by  Tlschendorf 
(1955) •   Ogawa  also  contributed  several  expository  chapters 
to  Sarhan  and  Greenberg  (I962).  A   comprehensive  summary  of 
the  whole  problem  area,  with  additional  new  results,  is  given 
in  the  treatise  by  Sarndal  (I962). 

These  references,  all  pertaining  to  linear  estimators 
(not  taking  into  consideration  here  applications  to  the 
theory  of  testing),  will  be  discussed  further  in  subsequent 
chapters. 

Although  data  processing  has  come  a  long  way  since  the 
early  marvel  of  sorting  machines,  the  method  of  using  sample 
quantlles  is  by  no  means  obsolete.   More  sophisticated 
methods  of  constructing  estimators  can  give  rise  to  elaborate 
problems  of  numerical  analysis  even  when  k,  the  ni;imber  of 
sample  quantlles  used,  is  quite  small  relative  to  the 
sample  size  n. 

In  addition,  an  important  application  has  been  found  in  the 
space  program.    Given  a  large  set  of  univariate  measurements 
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made  by  a  satellite.  It  is  sometimes  economical  to  have  only 
a  subset  sent  back  to  earth,  unless  all  computations  are 
performed  aboard  the  spacecraft.   It  has  been  shown  that  it 
is  cheaper  to  Include  the  necessary  equipment  for  ordering 
the  observations  and  selecting  the  subsample  than  to  provide 
a  more  elaborate  computing  system.   For  a  full  discussion 
of  this  problem,  see,  e.g.,  Eisenberger  and  Posner  (I965),  and 
references  cited  therein. 

In  view  of  the  above  considerations,  and  the  fact  that 
low  values  of  k  already  yielded  very  high  relative  efficiencies 
in  the  cases  studied,  the  problem  of  "systematic"  estimation 
seems  to  warrant  further  investigation,  particularly  in 
connection  with  problems  of  efficiency-robust  estimation, 
to  which  we  turn  next. 
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3.2.   The  Systematic  Generalized  Pitman  Estimator:   Exact  Version 

Given  n  Identically  distributed  random  variables 
X-,,...,  X  with  pdf  f(x  -  e,A)  satisfying  conditions 
(C1)-(C7)  for  A  =  1,...,    m,    and  k  real  numbers 
A.,  0  <  A,  <  A^  <  ...  <  A^  <  1,  let  X^^^^,...,  X^^^^  be 
a  subset  of  the  variables,  with  n.  as  defined  In  (^-l). 
Assume  that  the  spaclngs  are  symmetric.  I.e., 

(3.2)  \  =  1  -  \-n-i  '   1  =  i"--'  ^  • 

In  what  follows,  let 

and  denote  by  w(x^^^|0,A)  the  joint  pdf  of  the  k  sample 
quantlles  under  shape  A. 

Then  the  generalized  Pitman  estimator  based  on  this 
model  may  be  written 

m  ,    oo  (^) 


(5.3)  0p(k,n) 


.^1      ^^-oo 

m  ,    oo  /• ,  V 

—  g-   /        v^{x}^>\e,}.)ae 

.=1      '^^-oo 
and,    from   (4.2)    of   Section  4.1, 


G'    '     '  m 

A^ 
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m  -  00        k+1  n   -n        -1 

G^    '     '  m  oo  k+1  ~~~       n~^ IT — 

IZsa/  TT  [f(x,     .-e|A)-F(x,        .-e|A)]  ^    ^-i 

A=l      ^^-00  1=1  ^^1^  ^"^i-l^ 


IT     f(x,       .-0|A)d0 
.    1=1  ^^1^ 

T ^ ■ 

TT    f(x/„  ^-0|A)de 
1=1       ^"^1^ 


where 


0   ,        n^^^   =  n  +  1 


X,  ,      =    -     OD      ,  X, 


and 


(x|a)    =    /  f{t|A)dt    . 

J  —m 


All  properties  established  in  Theorem  2.2  for  the  generalized 
Pitman  estimator  apply  within  the  context  of  this  model: 

(a)  Unblasedness  follows  from  the  additional  assumption 
that  the  spaclngs  are  symmetric. 

(b)  The  variance  of  0  (k,n)  is  independent  of  6   because 
this  estimator,  too,  is  translation  invariant,  i.e., 

(3.4)     ^liL^^^    +   £|k,n)  =  e^(x^^^|k,n)  +  c  , 
where 

X^^)  +C.=  (X(^_^)  +C,...,  X(^^^  +c) 

and  c  is  an  arbitrary  constant.   We  have,  from  (5-3) 
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''a<i""«|.,n, 


m      r   oo   ^"T-L  r  -|  n .  -n . 

^^^J.J   ill    h'^(n,)-"^--'|A)-F(X(„^_^)-(e-o)|A)J  ^  - 

m      ,--  00  k+1  r  n  n .  -n .  I 


TT    f(x,      -(0-c)|A) 
.  1=1   ^"^1^^ 


de 


TT  f(x,  .-(e-c)|A)d0 
1=1   ^^1^ 


Performing  the  change  of  variable 
e'  =  0  -  c 

Immediately  yields  {3'^)' 

The  method  of  proof  of  part  (c)  of  Theorem  2.2  Is  also 
applicable,  since  Stein's  method  of  proving  the  admissibility 
of  the  Pitman  estimator.  Theorem  2.1,  can  similarly  be  applied 
In  this  context:   Let 


^(n.)  =  ^1  '    i  =  ^"•"    ^ 


and 


/    xw(x,x+y^,.. .,  x+y^_^)dx 

(3.5)       g(y) 


/    w(x,x+y^,.  ..,  x+y^_-|_)dx 
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Then  the  transformation  (2.5)  yields,  as  the  conditional 
density  of  X  given  Y, 


w(x+f(y),  x+f(y)+y   .  ..,  x+f(y)+y,  ,) 

(3.6)     P(x,y)  ^ 1 ii=2_ 

r  00 

/    w(x,  x+y^,...,  x+y^_-|^)dx 


and  the  proof  of  Theorem  2.1  proceeds  as  before. 
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3. 3*   The  Systematic  Generalized  Pitman  Estimator: 
Approjxlmate  Version 

Although  the  estimator  0„(k,n),  based  on  k  sample 
quantlles.  Inherits  all  the  desirable  properties  of  the 
full-sample  Pitman  estimator  within  the  context  of  Its  own 
model.  It  Is  evidently  also  extremely  difficult  to  compute. 
However,  the  fact  that  the  sample  quantlles  have  a  limiting 
multivariate  normal  distribution  suggests  the  possibility 
of  defining  a  corresponding  estimator  based  on  this  asymptotic 
model.   One  Immediate  result  Is  ease  of  computation. 

This  new  estimator,  denoted  0^(k),  will  be  Introduced 

la 

In  this  section,  and  Its  properties  corresponding  to  Theorem  2.2 
established. 

But  how  good  Is  an  estimator  based  on  an  approximate 
model?   We  will  find.  In  Section  5.4,  that  not  only  Is  S^{K) 
BAN,  as  n  — >  oo ,  In  the  class  of  estimators  based  on  the 
k-varlate  normal  model,  but  also  that  with  Increasing  k  it 
achieves  full  asymptotic  efficiency  in  the  class  of  all 
estimators,  uniformly  over  all  shapes  A. 

We  state  here,  for  ready  reference,  Hosteller's  theorem 
concerning  the  asymptotic  distribution  of  the  k  order  statistics, 

THEOREM  3.1.   (e.g.  Sarhan  and  Greenberg  (I962),  p.  I7). 

Let  X-., ,  X  be  n  Independent,  identically  distributed 

random  variables  with  pdf  f(x|0).   For  k  given  real  numbers 
for  which 


44 


0  =\<\<\<  ...   <\<  \^^  -  1 


let   the   A. -quant lie   of  the   distribution  be  x.,    that   Is 

(5.7)  /  f(tl0)dt   =  A      ,        1  =  1,...,   k   . 

<J  -on 


Assijme  that  f  (x)  ls_  dlfferentlable  in  the  neighborhood  of 
x=x.,  i=l,...,  k,  and  let 

f^  =  f(x^)  ^  0       for   1  -  1,...,  k  . 

Then  the  joint  distribution  of  the  k  sample  quantiles 

where 

fnA.      if  nA.  is  an  integer 
[nA.]+l   otherwise 

tends  to  a_  k-dimensional  normal  distribution  with  means 
X-., . . . ,   X,  and  with  covariance  matrix 

.       .   A.(l-A.) 

K  r  "  ^  1^  f.f.  ^    ]  '    1  1  J  ,   r  symmetric 

as  n  — >  00  . 

When  f  depends  only  on  a  location  parameter  9,    i.e., 
is  of  the  form  f(x  -  ©),  let  the  A.-quantile  of  the  standardized 
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(5.8) 


/  ^  f(y)dy  -  A 


SO  that 


(3.9) 


E[X(^_)]=  ^1  =  u.  + 


(For  a  derivation  of  (3-9),  and  other  related  results,  see 
Section  4.1.)   Then  the  pdf  of  the  limiting  distribution 
of  the  k  sample  quantlles  X,   ,,...,  X,       x  Is 


(3.10)   h(x^^^|0) 


(n_)k/2  IP, -1/2  ^^p^_  n  (^(  k)  _e_^)  ,  p -l(^(k)  _,_^)3 


where 


£  =  (0, ..., 
u  =  (u-j^,  .  .  . 

in  =  det  r. 


k-vector . 


u  =  (u-j_,  .  .  .,  u^)  , 


\{l-\) 


A-j^d-Ag) 

^1^2 


A,(l-A^) 


A-^(l-A2) 


l^^il-'K^) 


A^d-A^) 


A,(l-A^) 


^2^k 


vi-\) 


^l^k 
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and  f^  -  f(u^),  1  =  1,"',   k;  that  Is, 


(3.11)   h(x^^^l0) 


=  (^)^/'  f,...VA,(A2-A,)...(A^-A^_,)(l-A^)]-l 


/2 


•  exp 


2  Lfrr  (>^i+1-\HA.-A._-l)   1^  (n.) 


2^|^(-(n^)-^-Ui)(X(,^)-e-u,_,) 


In  what  follows  we  assume  that  the  spacings  satisfy  (5-2). 
Then  for  k  <  n  and  P  non-singular,  we  define  the 
approximate  systematic  Pitman  estimator 


(3.12)  e    (k)  = 


^  -oo 

[         h(x^^h0)d0 
"J  -nn 


[°°    e   exp{-  I  (x^^^-u-£)'S(x^^^-u-0)de 
_  J  -oo 

r°°  exp{-  ^  (x^^^-u-£)'S(x^^^-u-£)d0 
^  -oo 

where  S  =  [s.  .]  =  V  ~'^ ,    all  other  terms  being  defined  in  (3.10) 
For  simplicity  we  will  write  x,   x  =  x^.   Writing  out  the 
denominator,  we  have 
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k   k 


(j.ij)  J_^   exp(-|^^(.,-u.-e)(xj-u.-e)s,j)de 


k   k 


J_^    exp(-§g^^[U,-u,)(x.-u.)s 


ij 


)(x.-u.+x.-u.)s,j  +  0^s,.]]d0 


Completing  the  square,  we  obtain  for  (3.I3) 
(3.W)   e.p|-  ^  [r^^  (x,-u,)(.j-u.).,. 


n  k   k 


>    >    (X. -u. +x  .-u  .  )s  .  . 

— k — k 

1=1  j=i    ^^ 


12 


k   k 

5  (x.-u.+x  .-u.)s. 


J^ 


1  1   J   J 


k F 


i=l  J  = 


-1^ 


=  expression  up  to  Integral  , 
in  (3.1^) 


2'Tr 


k        k 


I^  J 


br^lJ 


Completely  analogously  we   obtain  for  the  numerator   of    (3.12) 
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(3.15)     /        e  exp{-  §ZZEI  (X  -u  -e)(x  -u  -e)s     }de 


1^  J^ 

k       k 


> 5:       T~  (x^-u,+x  ■-u.)s^  , 

r         K       1    /__27r____   .   feri^       1      1      J      J      Ij 
as   above    •  /  ■ ■- ^ — ^ = 


k   k  k   k 


so  that  (3.12)  reduces  to 

k   k 


(3.16)       0*(k)  =  "~"  ''"\ ^ 


SS 


<^l-V^J-"j'=13 


'f=rf^"iJ 


It  is  easy  to  verify  that  also 

(3.17)  0*(k)  =  0(k)  , 

the  maximum  likelihood  estimator,  and 

l'Sx(^)' 

(3.18)  e*(k)  =  e(k)  = , 

I'Sl 

the  least  squares  estimator  of  9   corresponding  to  the  given 
k-varlate  normal  distribution.   (Here  1  =  (1,...,  1)', 
a  k-vector,  and  we  assume  that  (Cl)  holds.) 

Let  us  assume  now  that  we  are  given  a  family  of  m  shapes 
f(x  -  0,A),  A  =  1, . . .,  m,  all  satisfying  (Cl)  and  the  conditions 
of  Theorem  3.1.   Supplying  the  Index  A  where  necessary,  we 
have  Immediately  the  corresponding  systematic  generalized 
Pitman  estimator  9^(k);  that  Is, 
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(5.19)    QnW 

TsTIT/q  I~    nJ\    ~I  ~~ri7^      r~I  ^ 


^^  e,(k) 


j=i  -^  ^  ^j  ^    -     -    -J       J  -       -    -J 


where 


sx  -  r,- 


with  n  the  covarlance  matrix  of  the  k-variate  normal 
distribution  for  shape  A, 


MLE  for  shape  A  , 


£^  =  (e^,...,  9^),    a  k-vector, 

u^  =  E^[X^^^]  -  £  , 
k   k 


and,  as  always,  >   g  =  1,  0  <  g  <  1,V  A- 

All  comments  In  Section  3.2  concerning  properties  of 
ep(k,n)  apply  to  0p(k)  within  the  context  of  this  model. 
In  particular,  0p(k)  Is  admissible  (with  respect  to  Q  =  0  x  a  j 
in  the  class  of  all  estimators  of  0)  on  the  hypothetical 
ass\;miptlon  that  the  respective  asymptotic  distributions  of 
the  k  selected  sample  quantlles  h(x    |e,A),  A  e  A,  represent 
the  true  model.   As  mentioned  earlier,  the  sense  in  which 
this  hypothetical  assumption  is  also  a  pertinent  approximation 
will  be  examined  in  the  next  section. 
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3.4.   Asymptotic  Properties  of  the  Approximate  Version 

As  before,  we  will  assume  without  loss  of  generality 
that  A  -  1  Is  the  true  value  of  the  shape  parameter.   Then 
(3.19)  may  be  rewritten  as 


(5.20)   e*{k)  ^ 


0^^^  =  .      -    gjfl^jl       u'vMi/2 


1  + 


^     m  g, 


m     g  _.  ^  fS"!  (l'S^l)\  -L/2 


A=2  ;l  +  ^—  ^1/  :_j 


f^iT^^rr-n-^V^ 


exp(-g.(x(^^-u^-e^)'s^(x^.^^V-e^)} 

exp{-^(x^^^-u^-£3_)'S-^(x^^^-u^-£^)} 


exp 
exp 


[-|(^('^'-uJ-g.)-S,(x"''-uJ-|T)7 


eAK 


e,(k) 


We  will  first  consider  the  limiting  distribution  of 
^*(k)  as  n  — >  00  but  k  remains  fixed. 
We  begin  with 
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LEMMA  5.1.   For  A  ^  1,  u^  ;^  u-*" ,  S   ^^  S-^, 


Proof:   We  have  x''''  =  (X,   ),...,  Xj^^  j) 
(3.22)  Ex['=(n,)l  ="i  ^^  • 


Now 

(3-23)  V3^[^(n.  ) ]  =  ,  "  Ai  t,2  — ^  ° 


A,(l-A,) 


^(■"i^   'n[f(u^|A)] 


=>^(n.)  -"E^[^(n.)J  ,   A  e  A 


so  that,  in  our  case. 


^(n,)  -'  "i  -^ 


From  (3.18)  we  have 


I'S  y^^^' 
^       ^A— 

^A  =   I'S  1     ' 
A 

SO  that 

I'S^ns.l 

(3.24)       Var[0.,  |l]  -  '        r<   — >  0  as   n  — >  00  . 

nd'S,!)^ 

Thus  0,  too  Is  consistent  and  goes  in  probability  to  &  with 
A 

increasing  n,  since 
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(3.25)     E^-T^^j=p,^+e  =  e  . 


That  l'S-,u  =  0  was  shown  by  Lloyd  (cf.  p.  24  of  Sarhan 
and  Greenberg  (I962)).   Thus  we  have 


x^^)  -  u^  -£i  ->o 


(3.26)     and 


X^^^  _  u"  -  _0^  -^>  u   -  u' 


Write  (3.21)  as 

(3.27)  u^(A)  =  exp{-  I  [(x(^^-u^-£^)'S^(x^^^-u^-£^) 

,  ( k)  1    2    \ 1  a    /     (k)   1  o  ^ 1 
-  (x^   -Ji  -£i)  S-|^(x^   -]i~£l^J} 

=  exp{-  I  v^(A)  }  . 
But  by  (3.26) 

(3.28)  v^(A)  -^>  (u^  -  u^)'S^(u^  -  u^)  >  0  , 


so  that  we  have  Immediately 

(3.29)  u^(A)  — >  0  . 

This  brings  us  to 

THEOREM  3.2.   ^Q ( 1^ )  is_  distributed  asyrnptotically  as 
N(0,    J  „  ^ ) ,  where,  as  before,  A  =  1  1e_  the  true  value  of 
the  shape  parameter. 
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Proof:   Write  (3-20)  as 

(3.30)     e^ik)   =   d^0^(k)  +  2Z  d^^^A^i^) 

A  =2 


■X-  ^  m        /\ 

(3.31)  /^(^^(k)  -  0)  =  d^^(0^(k)  -  0)  +  21:  d^/n(0^  -  0) 

A  =2 


From  the  lemma  we  have 


(3.32)     and 


d^  .  d,(x(^h  ^>  1 


d-^  =  d^(x^^M  ^->  0  ,    A  -  2,...,  m. 


But,  V  A, 


^  V   n(l'S.l)^/     ^     ^ 


(I'S^l)' 


so  that  application  of  Theorem  2.4  yields 


^(0j(k)  -  0)  _>n(o,  p^^J 


as  claimed. 

These  results  show  that  as  the  sample  size  n  Increases, 
0p(k)  Is  asymptotically  efficient  among  all  estimators 
based  on  the  k-varlate  normal  model,  uniformly  over  all 
shapes  A. 
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But,  letting  k  Increase  also,  we  have  an  even  stronger 
result . 

THEOREM  5.5.   Assume  that  f (x  -  e|A)  i£  everywhere 
differentlable  with  respect  to  u  =  x  -  6,  for_every  A  e  A  , 
a_rid  th_at  0  <  I(e;A)  <  00.   Then  e^(k)  i£  an  e-efficient 
estimator  uniformly  over  the  respective  shapes,  1_^.  , 

V  e  >  0  H  k^  3  k  >  k^  =:,> 

K^(k) 


(5.33)         -T[aT 

where 


>  1  -  e  ,     A  e  7\  , 


K,(R) 


I'V 


and 


1(A)  =  I(0;A)  . 

Proof:   We  have  seen  that  the  asymptotic  variance 
Var[0*(k)]  is  of  the  form  l/nl'S^l,  where  A  =  1  is  the  true 
value  of  the  shape  parameter.   We  are  dealing  here  with  a 
double  limiting  process,  where  first  n  — >  00,  then  k  — >  00  , 
but  still  k/n  — >  0.   (This  last  condition  is  already 
implicit  in  the  assumption  of  the  validity  of  the  k-variate 
asymptotic  distribution  for  every  k. ) 

Thus  next,  for  very  large  n,  we  examine  Yav[e^{K)] 
for  increasing  k.   In  explicit  form  we  have 
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k    (f.  . ,-,  -  f,  .)^ 

(3.54)        K^(k)  =  YZ         '       ^^^- 


where 


and 


\   '   0'  ^krt  =  1'  ^A,o  =  fx,k+l  =  ° 


f,  ,  =f(uJ|X)  . 


It  has  been  shown  by  Tischendorf  (1955)  that  K^(k)  Is  a 
nondecreaslng  function  of  k  (as  additional  quantlles  are 
added  to  a  fixed  spacing  A^,  1  =  1,.-.,  k)  and  that  by  suitable 
choice  of  additional  qu&ntlles  It  can  be  brought  arbitrarily 
close  to  Its  upper  bound  E^[(B/Su)  log  f(u|A)]  ,  I.e., 


(5-35)     \l^   log  f(u|A)]2  -  K^(k)  <  e' 


Now 

(3.36)    E^[|^  log  f(u|A)]2  =E^[^  log  f(x  -  e\^)f    ^   1(A) 

and  the  efficiency  in  question  is 

1        .^a(^)  ^ 

nI(A)Var[e*(k)]    ~^^  ' 
Prom  (3.35)  we  have 


1(A)  -  K^(k)  <  e'  , 
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so  that,  since  0  <  1(A)  <  oo  ,  V  A  e  A  , 

^  -  -T[aT   iTaT  -  ^  ' 

and 

IC(k) 
(3.37)         -:^>1  -  s  . 

Letting  k  =  max   ( k, ) ,  we  obtain  the  result  that  (3.35) 
°   AeA    ^ 

and  hence  (3.37)  hold  V  k  _>  k  ,  as  was  to  be  shown. 

Finally,  for  any  e  >  0,  we  obtain  e'/l(A)  ^  e   for  each  A  by 

determining  e'  above  by  e '  =  e  mln   (1(A)). 

A  e  A 
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3.5'   Case  When  Scale  Parameter  Is  Unknown 

Unlike  the  linear  estimators  to  be  treated  In  subsequent 
chapters,  in  general  the  Pitman  estimators  here  discussed 
are  not  scale  Invariant.   We  have  been  considering  families 
of  shapes  of  the  form  f(x  -  e|A),  a  special  case  of 

(3.38)  -f  f(^|A)  , 

In  which  cr.  Is  known  V  A  e  j^  . 

In  the  case  of  greater  practical  Interest  where  g,  , 
as  well  as  A,  is  unknown,  the  situation  is  more  complicated. 
A  systematic  generalized  Pitman  estimator  can  now  be  defined 
by 


m       X   ,x  ^         ^u-> 


Bj(Ti|jif  ^-P(-J?(i'''-SjH^-V'Sj'^'''-SjH^'-V)  ' 


where  now 


^A^^Cn.)]  =-^A 


and  CT^  is  an  estimator  of  cr,  .  • 

A  A 

Here  we  need  to  show,  in  order  to  establish  results 
like  those  of  Theorem  5.2,  that 
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(3.40)      exp{~  -^  ii^^^^-o-^}^-\)'\i^^^^-a^}l'-^^) 


.(^(k)_^y_e^),s^(^(k)_;_^^i_^_^)3  3  X>  0 


V  A  7^  1,    where   A   =  1   is   the   true   value   of   the   shape   parameter. 
Clearly  the    substitution    for   each  a,    of   an  estimator  which  is 

A 

consistent  when  A  is  the  true  value  yields  the  desired  result. 

i\n  attractive  choice  here  is  the  best  linear  unbiased 
estimator  of  a-^    introduced  by  Ogawa  (I95I)  and  based  on  the 
same  set  of  sample  quantiles  V  A. 

Using  matrix  notation,  in  order  to  indicate  the  analogy 
with  (3.18),  we  obtain 

u^S  x^^^' 

(3.41)  ;^(k)  =  ?^{k)  =  v^=5^ —  • 


u  S,  u 


Assuming  again  that  A  =  1  is  the  true  value  of  the  shape 
parameter,  we  have 

u^S^l 

(3.42)       E  [S^  (k)]  =  an  +  0  -r-H 

=  ai   , 
by  comments  following  (3«25).   Also, 


(3.43)        Var[ 
so  that 


^.(k)|l]  =  =_J_.Oz_  -^ 


n(u^S^u"') 


a^(k)  ^>  a^ 


59 


;5. 6.   Discussion 

The  advantages  of  using  ^^(k)  to  estimate  location  are 
attractive. 

Unlike  the  other  Pitman-type  estimators  discussed  earlier ^ 
it  is  extremely  easy  to  calculate  on  a  computer.   Knowledge 
of  the  exact  covariance  matrix  of  the  order  statistics 
involved  is  not  necessary,  in  contrast  to  some  of  the  other 
cases  discussed  in  this  paper.   0p(k)  in  addition  shares 
the  optimum  properties  of  other  Pitman  estimators;  it  is 
admissible  within  the  context  of  its  own  model,  and 
approaches  full  asymptotic  efficiency  uniformly  over  all 
shapes  A  e  A  even  in  the  context  of  the  exact  model  and 
full  samples. 

It  is  this  latter  property  that  really  distinguishes 
it  from  its  linear  competitors,  to  be  discussed  in  the 
following  chapters. 

However,  to  evaluate  its  actual  performance,  numerical 
work  is  necessary.   For  example,  how  fast  are  the  asymptotic 
values  approached?   For  small  n,  how  does  this  k-varlate 
model  compare  with  the  full-sample  Pitman  estimator?   How 
does  each  of  these  compare  with  corresponding  linear 
estimators?   Because  of  the  analytic  Intractability  of  some 
of  the  small-sample  variances  involved,  Monte  Carlo  studies 
are  being  planned  for  different  values  of  k  and  {A.  }  ,  and 
selected  families  of  underlying  distributions. 

The  problem  of  the  choice  of  k  and  {  A.  }  will  be  treated 
in  Chapter  6. 
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Chapter  4 

SYSTEMATIC  LINEAR  ESTIMATORS  COMPUTED  FROM 
COVARIANCE  MATRIX 

4.1.   Introduction 

Using  the  covarlance  matrix  of  the  order  statistics  for 
a  sample  of  size  n,  Lloyd  (I952)  obtained,  by  means  of  the 
generalized  Gauss-Markov  theorem,  best  linear  unbiased 
estimators  (BLUE's)  for  the  location  parameter  d.      His 
method  was  generalized  to  the  problem  of  efficiency-robust 
estimation  by  Laska  (I962),  who  derived  the  corresponding 
admissible  linear  unbiased  estimators  (ALUE's). 

Lloyd's  method  may  be  applied  to  the  exact  covarlance 
matrix  of  a  given  set  of  sample  quantlles,  yielding  best 
systematic  linear  unbiased  estimators  (BSLUE's)  for  given  k 
and  spacing  {A.}-   These  estimators  can  then  be  generalized 
to  provide  admissible  systematic  linear  unbiased  estimators 
(ASLUE's)  of  Q.      This  work  has  been  carried  out  in  Section  4.2. 

If  we  apply  the  Gauss-Markov  theorem  to  the  covarlance 
matrix  of  the  asymptotic  k-variate  normal  distribution 
of  the  sample  quantlles,  we  obtain  the  estimators  studied 
by  Ogawa  (I95I);  we  shall  call  these  asymptotically  best 
systematic  linear  unbiased  estimators  (ABSLUE's).   These 
estimators  are  defined  in  Section  4.3;  we  then  prove,  using 
results  of  Blom  (I958),  that  the  ABSLUE's  are  asymptotically 
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equivalent  to  the  BSLUE's  to  within  0  (n"^).   This  Is  a 
very  useful  result,  since  the  exact  covariances  are 
available  only  In  a  very  restricted  range  of  cases.   The 
fact  that  these  ABSLUE's  are  also  asymptotically  efficient, 
as  k  — >  oo ,  In  the  class  of  all  estimators  of  6   follows 
from  the  work  of  Tlschendorf  (1955)' 

In  the  last  part  of  Section  4.3  we  generalize  the 
ABSLUE's  to  the  problem  of  efficiency-robust  estimation  and 
derive  the  corresponding  asymptotically  admissible  systematic 
linear  unbiased  estimators  (AASLUE' s ) ;  the  asymptotic 
equivalence  to  the  ASLUE's  holds  analogously,  but  the 
asymptotic  efficiency  does  not.   That  Is  why  we  needed  in 
preceding  chapters  to  consider  non-linear  estimators. 

For  finite  sample  sizes  it  would  be  desirable  to  make 
independent  numerical  evaluations  of  the  properties  of  these 
estimators,  in  comparison  with  competing  estimators 
(including  those  discussed  in  Chapters  2,  3,  and  5)  and 
asymptotic  properties. 

As  before,  we  assume  throughout  the  chapter  that  a 
particular  spacing  {A.}  is  given  for  each  k.   The  problem 
of  optimum  spacing  will  be  treated  in  Chapter  6. 

For  convenient  reference  we  state  here  some  basic 
definitions  and  results.   These  may  be  found  e.g.  in 
Sarhan  and  Greenberg  (I962)  or  Blom  (I958). 
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distributed  random  variables  with  cdf  P(x  -6),    and 
corresponding  pdf  f(x  -  d) ,    where  E[X. ]  =  6  V  1,  and 
|e|  <  00  . 

(a )   Population  quantlle 

For  k  given  real  numbers  A.,  1  =  1,...,  k, 
3  0<A-,<...<A,  <1,  let  the  A. -quantlle  of  the 
population  be  x.,  as  already  stated  in  (5«7)j  i.e.. 


(4.1)         /   ^   f(t  -  e)dt  =  A.  ,      1  =  1,...,  k. 

^  -GO 


(b)   Sample  quantlle 

Then  we  may  define  the  k  sample  quantiles 
X,   ^  <  X/   ^  <  . . .  <  X,  )}    ^    subset  of  the  order  statistic: 


X,-,x  <    ...  "^  X/  s,    such  that 


nA.,     if  (nA.)  is  an  integer 


[nA. ]+l   otherwise,    1  =  1,...,  k. 


(c )   Exact  distribution  of  sample  quantiles 

The  joint  pdf  of  the  k  sample  quantiles  (for  6  ==  0) 
is  given  by 
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(4.2)  w(x^^^) 


k+1 

1=1   ^   ^  -^ 


k+1  n.-n.  ,-1  k 


if  X,   ,<...<  X/   % 


0      otherwise  , 
where 

%  =  ^   '      "^k+l  -  n  +  1  ^   ^ 


(%)  =""'   '(-k+l) 


(d)   Exact  moments  of  sample  quantlles 

(4.3)  E[X(^_)|e  =  0] 

,        r  oo    n .  -1  n-n. 

'^1°  (n.-l)?in-n.)jj.^  g"    (e)[l-F(0]    "<SP(I) 

1      n.-l     n-n. 


(n.-l)!(n-n.)I  j^   G(u)u  ^   (1-u)    ^du 


G(u)6   (u;n)du  , 
o       "^1 


where  G(u)  =  P~  (u)  Is  assumed  to  be  uniquely  determined  and 
P   (u;n)  is  the  beta  function.   Second-order  moments  may  be 

computed  from  the  following: 
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(4.4)  E[X(^_)  X(^_)|e  =0] 


^ij  =  (n.-l)!(n'.-n.-l)!(n-n.)J  /     /    ^^^^   ^^^ 
•  [F(ti)  -  P(|)]''j  ""^   [1  -  F(ti)]''  ''jdF(|)dP(Ti) 


where 


--   I  G(u)G(v)B   ^  (u,v;n)dudv 

o<u<v<l 


'lj(u,v;n)  =  ^_,),^j!!:_^),^,.j),  u-^v-u)J--^l-v)--J 


1  <  J  • 
We  obtain,  for  the  variance, 
(4.5)   Var[X(^_^]  =j         [Giu)f   p^_(u;n)du  -  ^J      G(  u)p^_  (u;n)duj' 


These  Integrals  do  not  always  exist. 

(e )   Sufficient  conditions  for  existence  of  moments  of 

sample  quantlles 

Blom  (1958),  p.  43,  gives  two  sets  of  sufficient 
conditions;  we  will  state  only  the  stronger  one  (since  It 
Is  simpler  and  sufficient  for  our  purposes): 


|E[X^^    ^]|     <   00         VI      if      E|X|    <   00 
and 


(n,) 


'^f^(n.)    ^(n.)^'     ^   "^  ^     ^'^      ^^      ^^^    ^   °°* 
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(Of  course,  this  holds  In  our  case,  since  we  have  \9\    <  oo , 

CT  =  1). 

(f )  Asymptotic  distribution  of  sample  quantiles 

This  is  Hosteller's  theorem  which  was  stated  already 
in  full  detail  as  Theorem  3.1. 

(g)  Limiting  form  of  first  and  second-order  moments  of  full 
set  of  order  statistics 

These  results  are  a  slightly  modified  form  of  Blom's 
(1958),  p.  53ff.   Unlike  him,  we  assume  the  stronger  set 
of  sufficient  conditions  for  the  existence  of  the  exact 
moments. 

First  we  need  the  following  definitions. 

DEFINITION.   c -sequence.   The  double  sequence  {x.     } 

V'  V 

,    is   called  a   c-sequence   if 

(4.6)  lim  -l=c,  0<c<l. 

V  — >  00      V 

-1/2 
DEFINITION.   Less  than  n  ^    -convergent  c-sequence. 


-I/2 
{x.     }  is  a  less  than  n  '  -convergent  c-sequence  if 

^  M,  p  3  for  all  but  a  finite  number  of  v ' s 


(4.7)  I  ^-H  >'*';'''"'■'' 


Then  we  have. 
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LEMMA  if.l.   (Mean)   If 

(a)  E|x|  <  oo 

( b )  G ( u )  and  G '  ( u )  are  boionded  and  continuous,  and 
G"(u)  is  bounded  In  ^  =  [0,1],  with  the  possible 
exception  of  a   finite  number  of  points  ,2* 

( c )  X .  belongs  to  any  c -sequence  when  c  e  ^ ,  and  to 

-1/2 
a   less  than  n  ^  -convergent  c-sequence  when 


c  e 


^* 


then 


(^.8)  E[X.|0  =  0]  =  G(^)  +  R^  , 

where  |R.  |  ^  rr  ^  and  M  ls_  Independent  of  1  and  n. 

LEMMA  4.2.   (Second-order  moments)   If_ 

(a)  EX^  <  CO 

(b)  G(u),  G'(u)  and  G" (u)  are  bounded  and  continuous 
and  G"'  (u)  is  bounded  in  ^-  [0,1]  with  the 
possible  exception  of  a  finite  number  of  points  C*, 


(c)   X.  belongs  to  any  c-. -sequence  when  c^  e  C,    and  to 

-1/2 
a_  less  than  n  '^    -convergent  c-, -sequence  when 

C-,  e  ^*;   X.  belongs  to  a  c^-sequence  {c-^   <_  c^) 

^  Cp  e  ^,  and  to  a  It 

Cg-sequence  when  Cp  e  <^' 


-1/2  *■ 
when  Cp  e  ^,  and  to  a  less  than  n  '  -convergent 


then 


(4.9)    Cov(X^,Xj)  =  i  p^(l-p^.)G'(Pi)G'(Pj)  +  R^j  , 


1  <  J. 
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where 


Pi  =  HTT  '   Pj  ^  H+r 


and 

IR.  . 

■J 

with  M  Independent  of  1 , j ,  and  n. 


^,,<i^ 


(These  lemmas  will  be  used  also  in  Section  5.1.   For  convenience 
of  application  we  have  changed  the  factor  (n+2)~   in  (4.9) 
to  n~  ;  the  difference  is  of  course  absorbed  in  R.  ..) 
(h)   Application  of  (g)  to  k  sample  quantiles 

Modifying  the  above  results  to  the  case  of  k  sample 
quantiles  we  may  write,  since 

(4.10)  lim   lim   — i-  =  A.  ,    i  =  1,...,  k  , 
k_>oo  n->oo  ^^-^  ^ 

and  G(u)  is  continuous, 

(4.11)  E[X(^_)|0  =  0]  ==  G(Aj^)  +  0(n"^) 
and 

(4.12)  Co^fX(n.)'^(n.)^  =  i-  A^(l-Aj)G'(A^)G'(Aj)  +0(n"^)  . 

Since  G'(u)  =  l/f(x),  we  see  that  this  application 
of  Lemmas  4.1  and  4.2  provides  sufficient  conditions  for  the 
given  moments  to  converge  to  the  corresponding  moments  of  the 
limiting  distribution  (cf.  (3.10)). 
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In  (5.8)  we  defined  u.  as  the  A.-quantlle  of  the 
standardized  distribution, 

(4.13)  u^  =  F"-^(Xi)  =  G(A^)  . 

Indicating  a  non-standardized  distribution  by  under- 
lining, we  have 

(4.14)  /   ^  f(y)dy  =  X.  , 
^  -00 


50  that,  by  (4.11), 


(4.15) 


But  since 


=  X,-  +  0(i)  . 


1     'n' 


(4.16)  f(y)  =  f(t  -  6)    , 


we  obtain 


r^i 


(4.17)  /    f(t  -  0)dt  =  P(x^  -  9)    =  -K^ 
^  -00 

so  that 

(4.18)  G(X^)  =  x^  =  G(A^)  +  0  =  u^  +  0  , 

yielding,  by  (4.15), 

(4.19)  E[X(^  )]  =  G(A)  +  0  +  O(i-)  . 
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4.2.   Systematic  Linear  Estimators  Based  on  Exact 
Covarlance  Matrix 
In  this  section  we  generalize  to  the  case  of  systematic 
estimators  of  6   the  results  of  Lloyd  (1952)  and  Laska  (I962); 
(Cl)  Is  assiomed  to  hold. 

THEOREM  4.1.   (BSLUE's)   Let  X,   v'--'  ^(r,    \    ^^   ^    ^^t 
of  sample  quant lies  such  that 

(4.20)  ^f^(n.)^  =  v^  +  e  , 
and 

(4.21)  Oov[X,^^,,X,„^,]  =t,j  =v.j  -v,Vj  , 
where  v .  .,  v. ,  v  .  are  defined  by  (4.3)  and  (4.4).   Let 

T  -  [t^.]    ,     1,  j  -  1,...,  k  , 

such  that  T~  exists,  and  let 

1  =  (1,...,  1)'  ,   a  k-vector  . 

Then 

1,T-Vk), 

(4.22)  e(k,n)  =  . 

I'T'^l 

Is  a_  best  systematic  linear  unbiased  estimator  (BSLUE)  of  6. 

Proof:   The  theorem  follows  Immediately  from  application 
of  the  generalized  Gauss-Markov  theorem  to  T,  for  estimating 
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the  location  parameter  when  the  underlying  distribution  is 
symmetric,  as  shown  by  Lloyd  (I952).   Clearly  0(k,n)  has 
minimum  variance  among  linear  unbiased  estimators  based  on 
the  given  k  sample  quant lies,  the  model  now  being  considered. 

THEOREM  4.2.   (ASLUE's)   Assume  that  the  conditions  of 
Theorem  4 . 1  hold  for  each  of  a  family  of  m  shapes  A, 
A  =  1, . . . ,    m,  where  now 

(4.25)  E,[X,^^,1  =v^H-e 

and 

(4.24)  Cov^[X(^^),X^^_^]  =  [tj.]  =T^  . 

Further,  let 

m 

(4.25)  M  =  5^  g^T 


A  A 


where  G  =  {g,  ,  .  .  . ,  g^}  is_  a  given  set  of  real 


numbers 


1  and  0  <  g^  <  1  V  A.   From  (4.25)  it_  follows 
that  M  is  positive  definite  and  hence  non-singular.   Then 


XI  g^  -  1  and  .   ^^ 


(4.26)  ep(k,n)  =   ^ 


is  an  admissible  systematic  linear  unbiased  estimator  (ASLUE) 
of  e. 
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Proof;   The  method  of  proof  is  completely  analogous 
to  that  of  Laska  (I962)  for  the  case  of  a  full  sample. 
Admissibility  here  is  claimed  over  O  =  (Q  K  A  (with  variance 
as  the  risk  function),  in  the  class  of  linear  unbiased 
estimators  based  on  the  k  sample  quantiles. 

The  proof  involves  applying  the  Gauss-Markov  theorem 
to  the  mixture  model  (2.8);  this  yields  the  estimator 


(4.27)  e    {\,,n)    =   ^ , 

^         l.rp^l-L 

where  Tp  is  the  covariance  matrix  of  X^    under  the  mixture 


model.   Although  in  general 


m 


it  has  been  shown  by  Craig  (I967)  that  under  the  symmetry 
assumptions  on  f(x|0)  and  the  A.  (defined  by  (3.2)),  valid 
in  this  case. 


(4.28)  l'T~-^l  =   I'M'^l 


and 

(4.29)        I'T^y.^^'  =  l'M~V^^ 


so  that 


e^(k,n)  =  0Q(k,n) 
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This  has  great  computational  significance,  since  we  need 

consider  only  M.   (Cf.  also  the  discussion  In  Blrnba\;im  and 

Laska  (1967a).) 

Admissibility  Is  now  easy  to  show  by  the  usual  method: 
Assume  e_,(k,n)  Is  Inadmissible;  then  3   ^  linear 

estimator  0'(k,n)  such  that 

(4.30)  Var[e'  |a]  <  Var[eU]  ,   V  A  e  A  , 

with  Inequality  strict  for  at  least  one  A  e  a* 

As  linear  estimators,  both  6'  and  0   have  variances 
Independent  of  6;    In  particular, 

(4.31)  Var[e„|G]  =  ^-  . 

^      nl'M~^l 


Now 

m 
Var[e'|G]  =XZ^a^f^'|A]     by  (2.9) 
A=l 

Var[0|A]      by  (4-30) 


S 


A= 
=  Var[0|G]  by  (2.9), 

but  this  contradicts  the  assumption  that  0^  Is,  by  the 
Gauss-Markov  theorem,  the  unique  BLUE  of  9   under  the  mixture 
model  considered. 
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4. 5-   Systematic  Linear  Estimators  Based  on  Asymptotic 
Covar lance  Matrix 
We  now  define  Ogawa's  estimator  of  0  and  show  that  It  Is 
asymptotically  (as  n  — >  oo  )  equivalent  to  the  best 
systematic  linear  unbiased  estimator  y(k,n).   By  the  result 
of  Tlschendorf  described  In  the  proof  of  Theorem  3'3j  such 
estimators  also  approach  asymptotic  efficiency  as  we 
let  k  — >  oo  .   As  always  f(x  -  6)    Is  assi;imed  to  be 
symmetric . 


THEOREM  4.5.   (ABSLUE's)   Let  X,   ^,...,  X,   ^  be  a 

vri-]_/       ^  k  —  — 

set  of  sample  quantlles  with  asymptotic  moments 


(4.32)  ^[^(n.)^  =  u.  +  0 

and 

_L  „      A^(l-A  ) 
"°^f^(n.)'^(n.)^  "I   -  nf(u.)f(u.) 

as  defined  by  (4.19)  and  (4.12);  we  are  disregarding  the 


remainder  terms. 

Let  S  =  T"^  exist,  and  let  1  = 

=  (1,... 

,1)'. 

Then 
(4.33) 

l'Sx(^^ 
0(k)  -    -i7<.n 

Is  an  a sympt ot Ic a lly  best  systematic  linear  unbiased  estimator 
(ABSLUE)  of  6;    that  iE_, 

(4.34)  0(k,n)  =  0(k)  +  0^{n~^)    , 

where  e(k,n)  ls_  as_  defined  by  (4.22)  . 
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Proof;   That  0(k)  is  best  among  linear  unbiased  estlmatorf 
based  on  the  asymptotic  k-varlate  model,  follows  immediately 
from  application  of  the  generalized  Gauss-Markov  theorem. 

It  is  similarly  easy  to  show  that  it  is  asymptotically, 
as  n  — >  00  ,  equivalent  to  the  BSLUE  based  on  the  exact 
model,  as  Indicated  by  (4.3^ )• 

We  have 

l'T~^x^^^' 
(^•35)  e(k,n)  =  :^ , 

I'T"  1 


where,  by  (4.12) 


X^(l-X  ) 


(*-?6'        ^^-J  °nt-(u^)f^'(u.)  ^  "'"'"' 


or 


(4.37)  T  =  i  r  +  [0(n-2)]  , 

where  the  last  term  represents  a  k  x  k  matrix  each  of  whose 
elements  is  of  the  form  0(n   ). 

We  now  substitute  (4.37)  in  the  expression  for  T  in 
(4.35)  and  carry  out  all  operations  as  indicated,  using 
standard  rules  for  working  with  0(*)' 
From  the  inequality 


(4.38)  P{|X|  >  6}  <   g 


E|X| 
e 

(Markov's  Inequality  for  r  =1)  we  can  derive  the  relationship 
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X(n.)  =%(^l\l)  ' 


and  finally,  in  simplest  form. 


(4.39)  S^'"l'  °°"'"  ■ 


P 


Application  of  (4.59) j  together  with  the  standard  operations 
Indicated  above,  yields  the  desired  result  as  stated  In 
(4.34). 

THEOREM  4.4.  '^(k)  £s  defined  in  (4.33)  is.  asymptotically 
efficient  as  k  — >  oo ,  in  the  class  of  all  unbiased  estimators 
of  9. 

Proof ;   This  result  was  proved  by  Tlschendorf  (1955); 
cf.  discussion  above  in  the  proof  of  Theorem  3.3. 

Theorem  4.3  may  be  generalized  to  the  problem  of  efficiency- 
robust  estimation.   This  will  be  done  in  the  following 

THEOREM  4.5.   (AASLUE's)   Assume  that  the  conditions  of 
Theorem  4.3  hold  for  each  of  a  family  of  m  shapes 
A,  A  =  1,...,  m,  where  now 

(it.40)  Ex[^(n^)]  ="!+» 

and 

A  (1-A  ) 
Cov,[X,   ,,X,   ,]  =  i  P ^ J — 


'A^-(n.)>-(n.)^  -  n  'A  -  ,f  (,A  ,  ^  ^^(^A  ,  ^^ 
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Further 

let 

Then 

ItN-l."^'' 

(4.41) 

0„(k)  -      "i  ' 
^       I'N^l 

is  an  a 

svmpt otic ally  admissible  systematic  linear  xmbiased 

estlmat 

or  (AASLUE)  of  9;    specifically. 

(4.42)         eQ(k,n)  =  ^Q(k)  +  0p(n-2) 


Proof:   The  arguments  are  completely  analogous  to 
those  given  in  the  proofs  of  Theorems  4.2  and  4.3- 


77 


4.4.   Discussion 

As  is  seen  from  the  definition  of  the  respective 
estimators,  knowledge  of  the  exact  covariance  matrix  of  the 
sample  quant lies  for  the  pdf's  under  consideration  is  needed 
for  the  computation  of  BSLUE's  and  ASLUE's.   A  list  of 
available  moments,  other  related  n\;imerical  results,  and  a 
more  extensive  discussion  of  this  problem-area  are  found 
in  Blrnbaiom  and  Laska  (1967a,  b). 

The  estimators  based  on  the  asymptotic  covariance 
matrix  are  more  practically  applicable,  since  the  asymptotic 
form  is  easily  computed  for  arbitrary  given  pdf's.   The 
necessary  inversion  of  the  matrix  in  question  does  not  appear 
prohibitive,  since  here  we  are  dealing  only  with  a  subsample  of 
size  k,  and  there  are  indications  that  small  values  of  k 
already  yield  good  results.   This  is  an  important  practical 
advantage  of  this  type  of  systematic  estimator. 

As  shown  in  Theorems  4-3  and  4.5,  the  ABSLUE's  and 
AASLUE's  are  also  asymptotically  equivalent  to  the  corres- 
ponding exact-covarlance  estimators.   But,  full  asymptotic 
efficiency,  attainable  by  single-shape  estimators,  is  not 
attainable  for  the  mixture  model  even  when  the  full  sample 
and  exact  covarlances  are  used.   This  of  course  was  the 
reason  for  introducing  above  the  non-linear  estimators  of  the 
generalized  Pitman  type. 
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For  a  single  shape  and  the  full  sample  an  asymptotic 

form  of  0(n,n)  was  obtained  by  Danlell  (I92O),  Bennett 

(1952),  and  Jung  (I955).   The  problem  of  deriving  the  analogous 

asymptotic  form  for  the  mixture  model  by  Bennett's  method 

seems  formidable,  as  it  would  require  an  explicit  expression 

m        -, 
for  [>   g  P  ]~  .   However,  Jung's  approach  lends  itself 


A^l  ^^'^ 
adaptation,  as  Yhap  (I966,  I967)  has  shown.   In  the  next 


to 


chapter  we  will  discuss  the  latter  approach  as  it  applies 
to  systematic  estimation. 

The  need  for  numerical  investigation  of  these  estimators 
has  already  been  mentioned  in  earlier  sections. 
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Chapter  5 

SYSTEMATIC  LINEAR  ESTIMATORS  BASED  ON 
CONTINUOUS  WEIGHT  FUNCTIONS 

5.1.   Derivation  of  Jung's  Result  for  a  Single  Shape 

Under  Weakened  (Necessary  and  Sufficient)  Conditions 

As  before,  let  X-,,...,  X  be  a  set  of  Independent, 
Identically  distributed  random  variables  with  pdf  f(x  -  0) 
satisfying  (CI). 

In  the  last  section  we  mentioned  the  asymptotic  version 
of  Lloyd's  best  linear  estimators  obtained  by  Bennett 
(1952),  Jung  (1955),  and  (less  rigorously)  by  Daniell  (I92O), 


(5.1)         e(h)  ,igh(jA_)x,^,  , 


where  X/-,^  <  ...  <  X,  ^  are  the  order  statistics  derived 
from  X-,,...,  X  ,  and  h(u)  is  a  continuous  function  of  u 
defined  on  [0,1].   It  was  also  shown  by  these  authors  that 
0(h)  is  asymptotically  efficient  in  the  class  of  all  estimators 
of  Q,    under  the  given  shape. 

In  this  section  we  will  derive  Jung's  result  under 
weakened  conditions  and  indicate  in  what  sense  these  conditions 
are  necessary  and  sufficient. 

Let 

(5-2)  Zn'IglhlHTr'^d)  . 
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where 

(5.3)      E[Y(.)]  =  E[X(.)]  -  e   =  E[X(.)|e  =  0]  , 

i.e.,  we  are  considering  first  the  estimator  based  on 
standardized  variables. 

LEMMA  5.1.   Let  h(u)  be  continuous  and  bounded  on  [0,1]. 
Assume  further  that  G(u),  G'(u),  and  G"(u)  are  bounded  and 
continuous  and  G'"(u)  is  bounded  on  [0,1],  where 
G(u)  =  P^^Cu)  =  y.   Then 


(5.4)  E[Z^]  =  /    h(u)G(u)du  +  0(n~  ) 

'-'  o 
and 

nil  -2 

(5.5)  Var[Z  ]  =-/    /    k(u,v)h(u)h(v)G'(u)G'(v)dudv  +  0(n   ) 


where 

u(l-v)    u  <  V 


(5.6)  k(u,v)  - 

v(l-u)    u  >  V 


Proof ;   We  have 
(5.7)  E[ZJ  =iZ:h(H^)E[Y(,)]  . 

Since  the  assumptions  of  Lemma  4.1  are  fulfilled  in  this 
case,  we  have,  by  (4.8), 
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(5.8)   E[ZJ  =  i  ^  h(H^)(G(j^)  +  O(n-l)] 


i  Z^  Mnir'atHTr'  +Kf^h(^)  •  o(n-i 


Let 


then 


n+1   "^l  '     n   -^1 


1  =   1,...,  n; 


and  since  h(u)  and  G(u)  are  continuous  and  bounded  on  [0,1] 

the  Rlemann  Integral  exists,  so  that  In  the  limit 

1  -, 

(5.9)        E[Z^]  =  /    h(u)G(u)du  +  0(n"-^) 


since  also 


(5.10) 


Similarly, 


=  J_h. 


(5.11)  Var[Z^]    =  J^h'(n)    Cov[Y^ . ^ , Y^ ^ ^ ]h(n) 


where 


so  that 


h(n)    =   th(^),...,    h(5^)] 


8a 


(5.12)  VarlZj  '  ^ ^t^  '^i^)^idT^OonY^,yY^  j^] 


4  Z^  :^  '^(HTr"^(Hir)  \k  ^i^.   ;^)G'(^) 


where 


(5.1?)      H-^,  ^) 


nird-sTr''     ^*^ 


we  have  here  used  (^.9)  of  Lemma  4.2,  which  applies  by- 
virtue  of  our  assumptions.   By  the  same  arguments  as  before 
we  establish  the  existence  of  the  double  Riemann  Integral; 
that,  with  (5.10),  yields  in  the  limit 

1   1 


Var 


[Z  ]  =  i  /    /    k(u,v)h(u)h(v)G'(u)G'(v)dudv  +  0(n"^) 

^  o  ^  o 


as  was  to  be  shown. 

COROLLARY  5.1.   Let 

n 


{5-li() 


ff(h)  "  i  Z^  Mnir'^d 


then 

_      .  1  1  . 

(5.15)     E[e(h)]=  /    h(u)G(u)du  +  e   I        h(u)du  +  0{n   ) 
'-'  o  ^  o 
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and 


1   1 


{5.16)  Var[e(h)]  =  i  /    /   k(u,v)h(u)h(v)G'(u)G'(v)dudv  +  0(n~^) 
^  o  ^  o 


Proof;   (5.15)  follows  from  (5.2)  -  (5.4)  and  (5.16) 
Is  an  Immediate  consequence  of  (5-5) • 

We  will  now  derive  Jung's  result  for  the  special 
case  of  estimating  6,    with  f(x  -  6)    symmetric,  using  the 
results  of  Corollary  5.1. 

The  proof  is  essentially  that  of  Jung,  based  on  the 
calculus  of  variations.   However,  we  give  it  in  a  somewhat 
different  form  which  will  make  it  more  convenient  to 
apply  in  later  sections;  the  terminology  follows  that  of 
Yhap  (1967). 


THEOREM  5.1.   If  the  assumptions  of  Lemma  5.I  hold  and 


(5.17)  lD| 


Jo  ^o^^'"'  j,   ^l(^ 


dv 


/    h  (v)G(v)dv     /    h-^(v)G(v) 
^  o  ^  o 


dv 


^  0  , 


where 
(5.18) 

and 


ho(^)  =  GW^  [gTTTt] 
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^i(^)  -  GTTTr-2  LG^irrl 


then  there  Is,  In  the  class  of  continuous  weight  functions 
h(u),  one  that  uniquely  minimizes  the  asymptotic  variance 
(5.16),  namely, 

h.(u) 


{5.19) 


1 
/    h  (u)du 
^  o 


Proof:   Lemma  5-1  guarantees  the  existence  of  the 
asymptotic  mean  and  variance  of  0(h)  as  given  by  (5. 15) 
and  (5.16).   We  want  to  find  h  minimizing  the  functional 


1  r  1 


nVar[^(h)]  =  /    f       k(u,v )G ' (u)G ' ( v )h(u)h(v)dudv 
'-'  o  ^  o 


iubject  to  the  subsidiary  conditions 


1^  1^ 

(5.20)       /    h(u)du  =  1   and   /    h(u)G(u)du  =  0 


From  (5.15)  we  see  that  these  are  necessary  to  Insure  the 
(asymptotic)  unbiasedness  of  F(h). 
Writing  n  Var[0"(h)]  as 

(5-21)   /   K(v,h(v))dv  =  /   G'(v)h(v)[  f    k( u, v)G ' ( u)h(u)du| dv 


we  obtain  for  the  Euler  equation 

1 
(5.22)   [K]   =  G'(v)  /    k(u,v)G'(u)h(u)du  +  A^  +  A-lG(v)  -  0 

^  o 
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where  A^,  A^  are  the  Lagrange  multipliers.   Solving  for  h, 
we  have 

,1  -  A^   A.G(v) 

(5.23)      J        k(u,v)G'(u)h(u)du  =  ^^p^  -  Qt(v)  . 


n   V  -  1 

(5.24)  (1-v)  /    uG'(u)h{u)du  +  v/    (l-u)G'(u)h(u) 


_  -  A^    A-^G(v) 
~  G'{v)  ""   G'(v)  • 

Differentiation  with  respect  to  v  yields 

V  1 

(5.25)   -  /    uG'(u)h(u)du  +  (l-v)vG'(v)h(v)  +  /   ( l-u)G' (u)h(u)du 

-   v(l  -  v)G.(v)h(v)  =  -  X„  ^  [^]-   Ai  ^  Ij^]     . 

Differentiating  again,  we  obtain 


(5.26)     -  vG'(v)h(v)  -  (l-v)G'(v)h(v) 


-    X    ^2  r  1  1  (j2  rojv)] 


so  that 

^-     d^  r  1  ]  ^    ^1    d2 


,5.27,  Mv,  .^^-i^[^].^A^[^] 


To  solve  for  A^  and  A-j^,  we  need  the  constraints  (5-20) 
Substituting,  we  have 
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(5.28) 
and 


^o  /    h^(^)dv  +  A.  /    h  (v) 


dv  =  1 


..1  .  1 

(5.29)   Aq  /   h^(v)G(v)dv  +  A-^  /   h,(v)G(v)dv  =  0 

'^  o  ^  o 


In  matrix  form,  this  may  be  written  a 

1  .  1 

o  ^  o 

1  1 


(5.30) 


Jo      '°^^^'^       Jo      'l^^ 

1  1 

J        h^{v)G(v)dv    /    h-^(v)G(v) 


dv 


DA  =  a  , 
so  that 

(5.31)  A  =  D"^a  , 
since  by  (5.17)     |d|  7^  0  . 

It  has  been  shown  by  Yhap  (I967)  that  for  f(x)  symmetric 

(5.32)  /   h  (v)G(v)dv  =  0  . 

^  o 

Solving  for  A,  we  thus  obtain 


and 


/   h^(v)dv 


A-,  =  0  , 
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which  yields  the  desired  result  (5.I9). 

Uniqueness  of  the  solution  has  been  shown  by  Jung  and 
the  argument  will  not  be  given  here,  as  It  Is  not  pertinent 
to  our  present  considerations. 

It  has  likewise  been  shown  by  Jung  that  Var['5"(h)],  when 
h  Is  the  minimizing  function  h(u)  defined  by  (5.I9),  Is  the 
reciprocal  of  Fisher's  Information  for  9,    so  that  0*{h)  Is 
asymptotically  efficient  In  the  class  of  all  estimators. 
Asymptotic  normality  of  this  estimator,  required  for 
efficiency,  was  shown  by  Chernoff,  Gastwlrth,  and  Johns 

(1967). 

We  see  from  the  above  that  sufficient  conditions  for 
Theorem  5.I  consist  essentially  of  G'"(u)  being  bounded  and 
h(u)  Integrable  on  [0,1],  these  assuring  the  existence  of 
the  asymptotic  moments  In  question.   Inspection  of  (5'19) 
and  (5.18)  tells  us  that  these  conditions  are  also  necessary 
for  the  minimizing  function  h(u)  to  exist,  and  this  Is 
precisely  the  extent  of  our  new  results. 

Jung,  In  establishing  his  analogue  of  Lemma  5'1» 
required  the  existence  of  a  bounded  fourth  derivative  on 
h(u).   In  view  of  the  form  of  h(u)  as  given  In  (5.I9), 
his  condition  amounts  to  the  requirement  of  seven  bounded 
derivatives  on  G(u),  as  compared  to  our  three. 

In  his  concluding  remarks  Jung  pointed  to  the  desirability 
of  further  weakening  these  conditions,  allowing  h(u)  to 


tend  to  oo  at  u  =  0  or  u  =  1,  provided  that  remainder  terms 
for  the  asymptotic  moments  could  still  be  obtained  that 
approached  0  not  too  slowly.   Now  In  our  results  we  made  use  of 
only  the  weakest  form  of  Lemmas  4.1  and  4.2,  assuming 
condition  (b)  to  hold  for  all  of  c5  =  [0,1].   It  is 
possible  that  investigation  of  the  full  use  of  these  lemmas 
would  lead  to  results  along  the  lines  indicated  by  Jung. 


5-2.   Direct  Analogue  for  Systematic  Estimation  and  Arbitrary 
Spacing 

In  this  section  we  apply  the  method  of  systematic 
estimation  to  the  case  discussed  in  Section  ^.1,    deriving 
asymptotically  best  systematic  linear  unbiased  estimators. 
These  estimators  are  valid  for  any  k  and  arbitrary  spacing 
{A.};  f"o^  the  uniform  spacing  A.  =  i/(k+l),  1  =  1,...,  k, 
the  weight  function  h(u)  is  seen  to  coincide  with  the  one 
given  by  Jung  for  the  full  sample. 

These  results  were  obtained  already  by  Sarndal  (I962) 
by  a  different  method  (extending  the  approach  of  Bennett 
and  Ogawa ) .   Our  work  is  of  interest  here  in  that  it  yields 
the  same  results  in  a  simpler  and  more  direct  manner,  and 
by  the  only  approach  that  seems  to  be  adaptable  to  the 
problem  of  efficiency-robust  estimation. 

In  what  follows  we  will  make  use  of  a  generating  function 
f*(x)  for  {A.}  introduced  by  Sarndal  (I962),  which  will  also 
be  of  central  interest  in  the  next  chapter.   Our  definition 


of  f*{x)   will  be  specialized  to  the  case  of  interest  in 
this  paper. 

DEFINITION.   Let  f*(x)  be  a  function  continuous, 

r    OO 

boimded,  and  positive  Vx,  and  3  /     f*(x)dx  =  1.   Assume 

'-'  -oo 
further  that  f*'(x)  is  continuous  and  bounded  Vx.   Let 

(5.55)  F*(x)  -  /     f*(t)dt  , 

^  -00 

(5.34)  G*(t)  =  (F*)~^(t)  , 

where  G*(t)  is  continuous  and  nondecreasing,  and  /    G*(t)dt  <  00 

^  o 
Finally,  let 

(5.35)  ^i  =  kir  '    i  =  1,...,  k  . 

The  set  of  spacings  {A.}  is  then  defined  in  terms  of  f*(x) 
as  follows: 

(5.36)  Aj^  =  F[G*(7rj^)]  ,   i  =  1,...,  k  , 

where  F(x)  is  the  cdf  of  the  random  variable  X.   Clearly 
A.  <  A.  -,,  i  =  1,...,  k-1.   We  shall  say  that  the  spacing 
(5.36)  is  generated  by  f*(x),  the  "generating  fxmction" 
of  {A^}. 

LEMMA  5.2.   Given  a  set  of  k  sample  quant iles 

X,   X, ,  X,   >,  determined  by  (A.},  and  a  function  h(u) 

[n-^j  ^^k 
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continuous  and  bounded  on  [0,1] ,  define  the  systematic 
linear  estimator 

(5.37)  \  =  ^^^(\)^^n^)    > 


where 


^tY(n.)]  =^f^(n.)^  -  '   =E[X(n^)i^  =0], 

i  =  1,...,  k  . 

Assume  further  that  G(u),  G'(u),  and  G"(u)  are  bounded  and 
continuous  and  G"'(u)  Is  bounded  on  [0,1].   Then 


pi 
o 
and 

1     r  1 
o 


(5.38)        E[Z^]    =    /        h(u)G(u)G'(u)f*[G(u)]du  +  OCn"-") 


(5.39)      Var[Z^]    -  i. j       j        k(u,v)h(u)h(v)[G'{u)]' 


•    [G'(v)]^f*[G(u)]f*[G(v)]dudv   +  0(n"^)    , 

where  f*(x)  ls_  defined  by  (5.36),  and  k(u,v)  by  (5.6). 

Proof;   The  proof  Is  quite  similar  to  that  of  Lemma  5.I, 
In  view  of  the  assumptions  on  G(u),  (4.11)  and  (4.12)  are 
valid,  so  that  we  obtain 


91 


5.40)        E[ZJ    =^IZh(^)E[Y(,^)] 


1      ^  1 

iy-  h(A.)[G(A.)    +  0(n   "^ )  ] 


^y~  h(A,  )G(AJ    +  0{n-^) 


i'^^  'i 


i:^h[F[G*(^)]]G*(i7ir)    +0(n-l)    . 


=  feS 


k+l'-"^      ^k+1' 


Let 


then 


^=Ay.    ,        klr-^i,        1=^1^        i=l,...,k; 


and  by  the   assumptions    on  h( • )    and  G*(')    the   Riemann  integral 
exists,    so  that  we   obtain  in  the   limit 

(5.41)  E[Z^]    =    /         h[F[G^(v)]]G*(v)dv    +  0(n"^)    . 

Now   let 

F[G*(v)]    =   u 

G*(v)    =  G(u)    =  X 


and 


dv        dF*        dG       IlU)    _  rifn^f*rrrnn 


so  that    (5-''^l)   becomes 
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(5.^2)   E[Z^]  =  r   h(u)G(u)G'(u)f*[G(u)]du  +  0(n~^). 

Similarly,  by  (4.12), 

(5.43)   Var[Z^]  =  ^  |^  5^  h(Xj^)h(X  .  )Cov[Y,^^,  ,Y(„^  , 


k   k 


^iZ^   h(A.)h(Aj)[k(A^,Aj)G'(A^)G'(Aj)+0(n-2)], 


where 


(5.44) 


nk 


k(A.  ,A  .)=< 


A^(l-Aj)   i  <  j 


^Aj(l-A^)   i  >  J 


Substituting  (506)  for  A.  as  before,  and  by  similar 
arguments  for  the  existence  of  the  double  Rlemann  integral, 
we  obtain  in  the  limit  as  k  — >  oo 

(5.45)  Yar[Z^]    =^J      J        h[P[G*(  s)  ]  ]h[F[G*(t )  ]  ] 


•  k[F[G*(s)],F[G*(t)]]G'[F[G*(s)]]G'[F[G*(t)]]dsdt  +0(n"^). 


Performing  the  change  of  varlablef 

F[G*(s)]  =  u 
F[G*(t)]  =  V 


and 


now  yields  the  desired  result. 
COROLLARY  5-2.   Let 

(5.46) 


^(h,k)  =^g^h(A,)X^^_, 


93 


Then 

1 
(5.47)    E[^(h,k)]  =  /    h(u)G(u)G'(u)f*[G(u)]du 
^  o 


1  _n 

+  9    I         h(u)G'(u)f*[G(u) ]du  +  0(n  ^) 


o 


J 

and  Var  [F{h,k)]  is  given  by  (5-39). 

Proof:   The  proof  Is  Immediate  from  (5-37)  -  (5- 39 
COROLLARY  5.3-   Assume  that  the  spacing  Is  uniform 


I.e. 


(5.48)  A,  =  i-i^  ,      1  =  1,...,  k 


1 

Then 

1 


k+T 


(5.49)   E[e(h,k)]  =  /   h(u)G(u)du  +  6   I        h(u)du  +  0(n   ) 


49)  E[U{h,i<i)  \   =  I        hlu)G(u}du  +  y  /   niu;au  +  u^n 
and 

(5.50)   Var['5"(h,k)] 

,11  -, 

=-1  k(u,v)h(u)h(v)G'(u)G'(v)dudv  +  0(n 

'^  ^  o  ^  o 

as  given  by  Jung  for  the  full  sample. 
Proof:   By  (5-56) 
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=>  G(u)  H  G*(u) 

=>  G'(u)f*[G(u)]  =  1 

and  the  result  follows  from  Corollary  5-2. 

Next  we  have  a  theorem  yielding  the  asymptotically 
best  systematic  linear  estimator  of  6   based  on  a  continuous 
weight  function  and  arbitrary  given  spacing  {A.]. 

THEOREM  5-2.   If  the  assumptions  of  Lemma  5-2  hold  and 

)1)  |D|  = 

j        h  (v)G'(v)f*[G(v)]dv      /   h-^(v)G'{v)f*[G(v)]dv 

^  o  ^  o 

/    h  {v)G(v)G'(v)f*[G(v)]dv   /    h  (v)G(v)G'(v)f*[G(v)] 

J  o  ^  o 


^   0 


dv 


where 


(5.52)       h  (v)  =  i 2-^Ig4 


f*[G(v) ][G'(v)]  dv 


^ 


,  ,  ,   1 d^  r  G(v)l 

^^^'^    =   f^[G(v)][G.(v)]'^  ^  1^^ 
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then  the  continuous  weight  function  h(u)  that  uniquely 

minimizes  the  asymptotic  variance  (5-59)  2L  ^(h,k)  Is 

given  by 

h^(v) 
(5.55)        h(v)  ^  2 . 


/    h  (v)G'(v)f^[G(v)]dv 


Proof:   We  want  to  find  h  minimizing  the  functional 


(5.54)   n  Var[F(h,k)] 

=  /    /    k(u,v)h(u)h(v)[G'(u)]2  [G'(v)]2 

^  O  '-'  o 


•  f*[G(u)]f*[G(v)]dudv 


subject  to  the  subsidiary  conditions 

1 
(5-55)  /    h(u)G'(u)f^[G(u)]du  =  1 

^  o 
and 

,  1 

/    h(u)G(u)G'(u)f*[G(u) ]du  =  ( 
^  o 


The  method  is  completely  analogous  to  the  one  used  In  the 
proof  of  Theorem  5.1. 

It  is  easy  to  check  that  F(h,k)  is  Identical  to  the 
asymptotic  form  of  the  linear  unbiased  estimator  obtained 
by  Sarndal  (I962),  p.  76.   He  gives  the  result 


7;^(U^)f(U^) 

(5.56)      11m   kg   -  — — ^^.       V 
k->oo    ^^  ^11^  ^1^ 


where 


u^  -  G(X^) 


.^U,  .  ^2|^lki  , 


-  prS  log  f(x)n2 
11  -  ^^    M  ^ 


and  g-,  .  is  the  coefficient  of  the  ith  sample  quantile 
X,   \,  i  =  1,...,  k.   Substituting  the  continuous  variable 
X   for  u. ,  we  obtain  the  correspondence  of  (5.56)  with  our 
weight  function  (5.53)-   We  note  further  that,  in  view  of 
Corollary  5.3^  the  weight  function  for  the  uniform  spacing 
coincides  with  that  given  by  Jung  for  the  full  sample. 


5.5.   Application  to  Mixture  Model 

We  will  next  consider  the  problem  of  constructing  an 
efficiency-robust  analogue  of  F(h,k),  i.e.,  an  asymptotically 
admissible  systematic  linear  unbiased  estimator  (AASLUE) 
of  6.      The  corresponding  ABSLUE  was  obtained  in  Theorem  5.2 
by  solving  for  the  function  h(u)  that  minimizes  Var  [F(h,k)]. 

From  the  discussion  of  the  method  of  constructing  efficiency- 
robust  estimators  in  Section  2.2,  and  in  particular,  from 
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(2-9),  we  see  that  an  asymptotically  admissible  estimator 
F  (h,k)  may  be  obtained  by  minimizing  Var[F^(h, k) | G] ,  where 


(5.57)      Var[F^(h,k)lG]  =  211  Sa  ^^^f^G^'^'^^l^^  * 

A=l 


The  model  is  the  usual  one:   we  assume  given  m  densities 
f(x  -  e|A),  A  =  1,...,  m,  a  distribution  G  =  {g^,...,  g^^}  , 
0  <  g,  <  1  V^'  ^"^^   ^   ^^^    °^   ^  sample  quantiles  X,  .,...,    X,   ^ 
We  assume  that  all  regularity  conditions  introduced  earlier 
are  satisfied  when  necessary. 

Thus  we  need  to  solve  for  h  minimizing 

m      ;  1  ,  1  p        p 

(5.58)   IZ  Sa  /    /    k(u,v)h(u)h(v)[GJ^(u)]^[G^(v)]^ 
A=l    ^  o  ^  o 


f*[G^(u) ]f*[G^{v)]dudv 


where 


G^(u)  =  F^^(u)  =  X 


and 


As  we  see  from  Corollary  5.3,  for  the  uniform  spacing 
A.  =  l/(k+l)  the  problem  coincides  with  that  of  obtaining 
an  asymptotically  admissible  linear  unbiased  estimator 
(AALUE)  of  6,    that  is,  the  corresponding  estimator  for  the 
full  sample  size  n.   Using  the  same  weight  function  h(u) 
we  have  then 


(5.59)  e^(h)  =i|;^h(jir)X(i, 

and 

(5.60)  S-^(h,k)  =i^h(iA_)X(^^^  . 


The  problem  of  obtaining  the  AALUE  has  been  solved  by 
Yhap  (1966,  1967),  and  is  thus  directly  applicable  to  our 
situation  for  the  particular  spacing  mentioned  above. 

The  choice  of  optimum  spacings  for  any  given  k  will  be 
treated  in  the  next  chapter;  however,  further  numerical 
work  may  corroborate  the  early  indication  that  a  rather 
simple  method  of  spacing,  such  as  the  uniform,  will  yield 
efficiencies  almost  equal  to  those  of  the  "optimum"  spacing. 
If  an  Increase  of  precision  is  desired,  the  possibility  of 
slightly  increasing  k  may  be  considered  more  attractive 
than  improvement  in  the  spacing.   Thus  the  estimator  for  the 
uniform  spacing  is  of  particular  interest  here. 

In  view  of  this  fact  we  will  briefly  outline  Yhap's 

method,  but  without  giving  explicitly  here  the  somewhat 

lengthy  statement  of  the  iinderlying  regularity  conditions. 

The  method  entails  minimizing  with  respect  to  h  the 

functional 

rn      ,  1    1 
(5.61)     ZZSa/    /    k(u,v)h(u)h(v)G'(u)G'(v)dudv 

subject  to  the  subsidiary  conditions 
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1 


(5.62)  /        G    (u)h(u)du  =  cx^    ,      A   =  0,    1,...,    m 

•J   n 


where 


Let 


G^{u)    =  1    ,      a^  =   1    ,      and     a     =  0     for     A  ^  0    . 


(5.65)  ^=   {J|J   =   J(u)    =  JZ  ^A^A^'^^    '    ^A   ''^^^'    A    =   1,...,    m} 

A=l 


and   define  the   following  bilinear   functional   on  '^'^V'- 

11  ,  , 

(5.64)  (Ji^"Jj)h  "   /        /        k(u,v)J^(u)Jj(u)h(u)h(v)dudv  , 

where  J.,  Z  .  c   7^ ,    and  k(u,v)  Is  as  defined  by  (5.6).   Yhap 

shows  that  under  certain  regularity  conditions  (J., J.),  Is 

a  real  Inner  product  on  ^  -<  2^,  and  then  he  orthonormalize^ 

G, ,  A  =  1,...,  m,  with  respect  to  this  : 
A 

result  Is  a  set  of  orthonormal  functions 

J-1 
Gj(u)  -  2^  (Gj,J^)J^(u) 

(5.65)  J,(u)  =  i^^T ,  j  =  1,...,  m  , 


where 


and 


|!Gj(u)  -  ;^  (Gj,J.)J.(u) 


(G.,J,)  .  (G.,J.)^ 

Fll  =  [(J,J)h]^^^  • 


Thus  we  can  write 


100 


(5.66)  J  =  •^G 

where 

J  =  [J-l(u),...,  J^{u)1 


and 


so  that 


G  -  [G-l(u),...,  G^(u)J   , 

A  =  t^ii^   i^  ^  non-singular  matrix. 


(5.67)  G  -  B J 

where 


[b,,]  =  «-i 


-J 

Yhap  next  shows  that  the  fiinctlonal  (5.6I)  to  be  minimized 

can  be  expressed  In  terms  of  these  orthonormal  fimctlons; 

I.e., 

m      ,-  1  -  1  ,  , 

(5.68)  y~g,  /  G,(u)h(u)k(u,v)G.(v)h(v)dudv 

T^  1 J  o  ^  o  ^  ^ 


1  '  1  m     ,  m 

y~   c  J  (u)h(u)k(u,v)Z 
p-1   ^  ^  q= 


/    /   2Z  c„J„(u)h(u)k(u,v)XZ  c„JQ(v)h(v)dudv  , 
^  o  ^  o  p^l  ^  ^  q=l   ^  ^ 


where 


By  a  method  completely  analogous  to  that  given  In  the  proof 

of  Theorem  5.I,  we  now  obtain  as  the  function  minimizing  (5.68) 
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(5.69) 

ho(-) 

2 

1 
m             , 

and 

(5. 70) 

h    (v) 

J'     ^'^ 

)dv 

The  basic  problem  now  of  course  Is  that  the  inner  product 
(J.,J.)u  with  respect  to  which  orthonormalization  takes 
place  is  itself  a  function  of  h,  so  that  we  can  obtain  a 
solution  only  by  iteration.   Such  an  iterative  procedure  has 
been  developed  in  detail  by  Yhap. 

In  order  to  generalize  this  approach  to  obtain  a  contin- 
uous weight  function  for  an  arbitrary  fA.}j  one   must  show 
that  the  expression  suggested  by  (5.58),  namely, 

{5.71)   /    /    k(u,v)h(u)h(v)[G  (u)]^[G  (v)]^ 

•  f*[G^(u)]f*[G.(v) ]dudv  , 

is  an  inner  product.   Orthonormalization  and  an  iterative 
procedure  analogous  to  Yhap's  should  then  yield  the  desired 
solution.   The  author  plans  to  carry  out  this  work  in  the 
near  future. 
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Chapter  6 


THE  PROBLEM  OF  OPTIMUM  SPACING 

6.1.   Ogawa's  Optlmuin  Solution  for  a  Single  Shape 

Fisher's  Information  for  the  asymptotic  k-varlate 
normal  model  defined  above  In  (3-11)  Is  of  the  form 
k    [f(u.,.)-f(u  )]2 

(6.1)  K(k|L)=xi  — r^ — - — ' 

i^     ^1+1^1 
where  we  use  L  to  specify  the  given  spacing,  I.e., 


(6.2)  L  =  (A-^,...,  ^^)    . 


This  expression  was  Introduced  In  Section  3.4  (Eq.  (5.34)) 
In  the  context  of  the  mixture  model  as  IC(k),  the  Information 
corresponding  to  shape  A.   There  we  also  saw  the  relationship 


(6.3)         K(k|L)  -  I'Sl 


n  Var[e(k)] 
where 

Is  the  ABSLUE  of  G    (as  shown  In  Section  4.3),  the  estimator 
studied  by  Ogawa  (1951)- 

Ogawa  defined  as  optimum  spacing  the  vector  L  ^   (A-j^,...,  A^) 
that  maximizes  (6.1),  the  Information  with  respect  to  9 
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(or,  equlvalently,  minimizes  the  variance  of  the  ABSLUE  of  6). 
Differentiating  (6.1)  with  respect  to  A.,  we  obtain 


(6.4)  ^^^ 
i 


f   -f 
^i+1  ^ 


^1+1"^ 


1  .  'i-'i-i^  f  fitLlfi 
1   ^i~^i-iy  v^i+l"^i 


^1-^1-1   2f :  N 
\-\-i    ^1 J 

=   0 

1  =  1,... 

,    k   , 

where 


^i  =  ^("i)-   <  =  ^ 


u=u. 


Tischendorf  (1955)  gave  regularity  conditions  for  the  pdf  f 
under  which  a  necessary  condition  for  L  to  maximize  (6.1) 
is  that 

f.  ^, -f .    f.-f .  ,    2f  ! 

^1+1^1     ^1  \-i      ^1 

This  had  already  been  shown  for  the  normal  distribution  by 
Ogawa,  who  also  proved  that  in  this  case  the  system  of 
equations  (6.5)  has  one  and  only  one  solution  and  that  the 
optimum  spacing  is  symmetric  (cf.  Ogawa  in  Sarhan  and 
Greenberg  (I962),  p.  273)- 

Numerical  solutions  for  the  normal  case  were  obtained 
by  Ogawa  for  k  <_  10,  and  the  resulting  relative  efficiencies 
E(k|L),  where 
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,6.6,       E,K|L)=J%^,     I  =  E[ai°;f(^']g  , 

were  compared  with  those  obtained  by  using  the  approximate 
spacing 

(6.7)  \  '^    ■ 

Ogawa  also  computed  the  best  spacing  for  the  exponential 
distribution  for  k  <_  I5.   All  these  results  are  tabulated 
in  Sarhan  and  Greenberg  (I962). 

Sarndal  (I962)  gives  X.  =  i/(k+l)  as  the  optimum 
spacing  for  the  logistic  distribution.   A  proof  of  this  and 
related  numerical  results  are  found  in  Gupta  and  Gnanadesik? 
(1966). 

Optimum  spacings  do  not  seem  to  be  known  for  other 
families  of  distributions.   It  is  interesting  to  observe 
from  these  results  that  (a)  the  approximate  spacing  (6.7) 
compares  very  favorably  with  the  optimum  one,  and 
(b)  high  efficiencies  (>  .9)  are  obtained  already  for  small 
values  of  k(4  <  k  <  10). 


6.2.   Sarndal 's  Nearly  Optimum  Solution  for  a  Single  Shape 

In  view  of  the  extreme  difficulty  of  solving  for  the 
optimum  spacing,  Sarndal  (I962)  proposed  an  approximate 
solution  expressed  in  terms  of  the  generating  function  f*(x) 
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defined  in  Section  5-2.   His  nearly  optimum  spacing,  or 
equivalently,  the  generating  function,  is  obtained  by  using 
an  asymptotic  expansion  and  a  limiting  integral  involving 
the  relative  efficiency  E(k|L)  defined  in  [6.6)    and  by 
calculus  of  variations  solving  for  the  function 
f*(x)  =  f*(x)  that  ma:ximizes  the  efficiency. 

Briefly,  following  Sa'rndal  (Chapter  4),  we  have 

,6.8)       E(k|L)  =  aM£i .  1  -  i-yii-)  . 

Expanding  both  I  and  K(k|L)  in  Taylor  series,  Sarndal  obtains 
for  the  difference 

(6.9)  I  -  K(k|L) 

=  ^IZ   (?^i+i->^i)^ff"[G(A.)]}2  +  0[(A.^^-A^)'^]  . 

Further,  we  have  the  relationship 

(6.10)  A^^3_  -   \   =   F[G*(7r.^^)]  -  F[G*(7r^)] 

-,    f(u  )        p 

This  follows  from  the  (extended)  mean  value  theorem  for 


(6.11) 


f(u.) 


dF       _  

dt  It-TT^   f*(u^)  • 
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Substituting    (6.9)    and    (6.10)    In    (6.8),    we   obtain 

,  k      pf(u.  )-|-z  P  u 

(6.12)    E(k|L)    =   1 ± ^T~     f*nrT  ^^"  [G(A.)]}^   +  0(k-^) 

121  (k+1)^   1=0    L^    ^^1'-^  ^ 


k      [7'(u.)]^f(u    )  u 

1     jYZ     ,    "      _.  "    +o(k-^ 


12I(k+l)^   1=0        [f*(u^)]^ 

/      L,  /       \  S      log     f  (X  )  N 

(where  yix)    =  ^| ^—^) 


12I(k+l)'^^ -00         [r*{x)r 


if  the  Integral  exists. 

Up  to  this  point  we  have  assumed  that  all  necessary 
regularity  conditions  on  f  are  satisfied,  and  the  Integral 
Is  obtained  by  arguments  similar  to  those  given  In  the 
proof  of  Lemma  5.1.   If  the  Integral  In  (6.12)  does  not 
exist,  Sarndal  considers  the  alternative  expression 

(6.13)  E(k|L,e)  =  1  -  e  -  —^-^  \  I   '^"-"'   l2Ji20l!|(20    1 

12I(k+l)'^  LJg(5)      [f*(x)]2      IJ 

■2, 


(k  '^) 


with  e  >0,  e,  arbitrarily  small  and  5  =  5(e)  small  and 
positive. 

Thus  by  (6.12)  or  (6.I3)  the  relative  efficiency  is 
expressed  explicitly  as  a  function  of  k  and  the  spacing  A^^ 
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(via  f*(x)).   We  see  that: 

(a)   For  fixed  f*(x),  we  can  Increase  the  efficiency  by- 
increasing  k;  the  efficiency  tends  to  1  (6.12)  or  1  -  e 
(6.13).   e  is  the  loss  of  efficiency  caused  by  cutting 
off  the  extremes  of  the  range. 

(h)      For  fixed  k,  we  can  maximize  the  efficiency  by  a  choice 
of  f*(x)  that  minimizes  the  integral  in  (6.12)  or  (6.13). 

In  either  case  the  Integral  is  minimized  subject  to  the 
subsidiary  condition  /^  f*(x)dx  =  1. 

Straightforward  application  of  the  calculus  of  variations 
yields 

(6.14)  f*(x)  =  i{[r'(x)]2f(x)}l/5  , 


where 


c  =  /  °°{[7'(x)]2f(x)} 
>-'  -nn  . 


i/'d^  . 


Sarndal  classifies  fjjj(x)  as  Type  I  if  the  integral  in  (6.12) 
exists;  otherwise  it  is  said  to  be  of  Type  II.   It  evidently 
yields  a  nearly  optimum  solution  since  it  is  based  on  an 
approximate  expression  for  the  efficiency  E(k|A),  but  the 
latter  still  tends  to  at  least  1  -  e  as  k  — >  oo ,  where  e 
is  arbitrarily  small. 

The  nearly  optimum  generating  function  was  obtained 
by  Sarndal  for  a  number  of  distributions,  including  the  cases 
studied  numerically  by  Ogawa.   Tabulated  comparisons  of  relative 
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efficiencies  for  different  values  of  k  Indicate  good 
results  for  these  nearly  optimum  spaclngs. 

In  a  number  of  Instances  f  (x)  takes  a  simple  form. 
For  example,  for  the  normal  distribution  with  only  the  mean 
6   unknown,  we  have 

/OTT 

Which  generates  the  nearly  optimum  spacing 


Xl=F[GJ^^)] 


=  i[v/3  i-l(irj^)]  ,    1  =  1,...,  k  . 


Sarndal  remarks  that  such  simple  expressions  for  the 
generating  function  are  the  exception  and  not  the  rule, 
but  points  out  that  even  numerical  Integration  and  tabulation 
are  in  general  easier  to  perform  than  solving  the  system  of 
equations  [6 A)    for  the  optimum  spacing,  which,  moreover, 
has  to  be  done  separately  for  each  value  of  k. 


6.3.   Admissibility  Concept  for  Mixture  Model 

Given  a  family  of  m  shapes  f(x|A),  X,  j  e  A,  define j 
as  in  (3.3^) 


109 


k  [f      -  f   1^ 
(6.15)      K^(klL)  -  ^^-Aii±^_ ^^Al_ 


where 
and 

Furthermore,  let 

K,  (k|L) 

(6.16)  E^(k|L)  =  -Tjyj-  , 

where 

For  any  k  the  optimum  spacing  for  shape  j,  say 

(6.17)  L°=(X°j,...,^°j)  , 

Is  obtained  by  maximizing  (6.I5),  yielding 

K.{k|L°) 

(6.18)  Ej(!c|L°)  =  ^^j^  , 

the  efficiency  of  the  optimum  spacing  L°.   Then  the  relative 
efficiency  of  any  other  spacing  Is  given  by 

E  (k|L)    K.(k|L) 

(6.19)  Rej(k|L,L°)  =  ,J..  ,  =  -I 


E°{k|L°)    Kj(k|L°: 


We  are  Interested  In  obtaining  a  spacing  L  which  In  some  sense 
■1.S  optlmiom,  say  admissible,  over  the  m  shapes  f(x|j),  J  e  A. 
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LEMMA   6.1.      A   spacing  L*   =   (A^,...,    A^)    Is   admissible 

over  A    lf_  lt_  ma:?;lmlzes  >        g  .K  .(kJL) ,    i.e. ,    If 
J=l      -^    -^ 

m  m 

(6.20)  KQ(k|L*)    =  J—  gjKj(k|L*)    =   sup  ^  s.K.{k\L)    . 


Proof:      AssxAine  L*   Is   not   admissible   over   j.      Then    ^ 
spacing  L'   s 

Kj{k|L')    >  Kj(k|L*)     V  J 

with  Inequality  strict  for  at  least  one  j.   But  then 
m  m 


^gjK.(k|L.)  >^ejKj(lc|L.)  , 


contradicting  (6.20). 

This  definition  of  a  spacing  admissible  over  A  applies 
analogously  to  the  nearly  optimum  solution  proposed  by 
Sarndal. 

Regardless  of  how  L*  was  obtained.  It  can  be  evaluated 

by  the  expression 

K.(k|L*) 
(6.21)       Re,(k|L*,L°)  =  -=^ — —  <1. 


K.(k|Lp 


As  pointed  out  In  Section  6.1,  for  the  case  of  a  single 
shape,  maximizing  K(k|L)  Is  equivalent  to  minimizing  the  variance 
of  the  corresponding  estimator  (ABSLUE)  of  6.      This  analogy 
does  not  hold  for  the  mixture  model,  since  In  general 


111 


m  m        -, 

(6.22)      XI   g.K.dclL)  ^   lUZI  Sin-]    1  ' 

where  the  second  term  Is  the  reciprocal  of  the  variance 

(times  n)  of  the  corresponding  AASLUE,  as  defined  In  Theorem  4.5, 

Here  we  deal  exclusively  with  the  concept  of  Information 
based  on  the  k-varlate  normal  model,  and  consider  It  In 
various  forms.   This  approach  Is  applicable  to  other 
estimators,  such  as  the  systematic  Pitman  estimator  discussed 
In  Chapter  J>,    for  which  also  a  spacing  must  be  provided. 

In  practice  usual  computational  procedures  can  be 
applied  to  determine  admissible  and  maxlmln  estimators, 
with  the  class  of  estimators  extended  so  as  to  include  those 
based  on  all  possible  spaclngs. 


6A.      Admissible  Spacing  Based  on  Exact  Solution 

The  problem  is  to  solve  for  L  maximizing 

m 
(6.23)  Kg(k|L)  =  J~   gjKj(k|L)  . 

A  necessary  condition  is  that  L  satisfy  the  system  of 
equations 
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aK^(k|L)    ji_  SK.(k|L) 


(6-2^)  -^3a— =  ?=;^J-^ 


0 


1  =  1,..  .,  k  . 

The  problem  of  how  to  solve  for  L  (questions  concerning  the 
existence  of  any  simplifying  necessary  conditions,  uniqueness 
of  the  solution,  or  ways  of  finding  ths  Tiay.lnizlng  set) 
remains  open. 

Evidently  this  is  a  much  more  complex  situation  than 
the  one  involving  only  a  single  shape.   Writing  (6.4)  simply  as 


(6.25)  x^(L)y^(L)  =  0  ,     1  =  1,...,  k  , 


Tischendorf ' s  regularity  conditions  on  f  guarantee  that 

x.(L)  j^   0  for  any  1,  L,  and  this  yields  the  necessary  condition 

(6.5),  i.e., 

(6.26)  y^{L)  =  0  ,    1  =  1,..-,  k. 

The  corresponding  system  of  equations  for  the  mixture  model 
is 
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(6.27) 
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and  assuming  that  x^.(L)  ^   0  for  any  i,  j,  L  does  not  yield 
any  Immediate  simplification,  since  x^.(L)  and  yij(L)  may 
both  assume  positive  and  negative  values. 

Examining  this  system  for  specific  families  of  shapes 
and  small  values  of  k  would  be  a  start  In  attacking  the 
problem,  unless  a  large-scale  numerical  Investigation  were 
desired. 


6.5.   Another  Possible  Approach 

Given  the  optimum  spacing  L°  for  each  shape  j  e /\  ,    It 
Is  possible  and  plausible  that  the  following  relationship 
holds : 


(6.28) 


mln  A°.  <  A.  <_  max  A°  .  ,    1  =  1,...,  k  , 

.1    ^^  ^    J    ^ 


where 
(6.29) 
and 
(6.30' 


Kj(k|L°)  =  sup  KjCklD 


K_-(k|L*)  =  sup  XZ  g.K.(k|L) 
^  L   j=l   J  J 


L*  =  {A-,,...,  A,  )  Is  the  admissible  spacing  corresponding  to  G. 
If  (6.28)  Is  true  and  L^  Is  actually  known  V  j,  a 


lli^ 


suitable  approximation  to  L-*  would  be  given  by 


m 

(6.31)  L.  =  y—  g,L° 


i.e., 


m 
A.  n    =   J~   g.^?.  .     i  =  1,...,  k 


^i,G  -fzr-j-iJ 


which  clearly  satisfies  (6.28): 

(6.32)      min  A?  .  <  A.  „  <  max  A?.  ,    i  =  1,...,  k 


Briefly  the  problem  might  be  stated  as  follows: 
Given  a  set  of  real-valued  functions  K.(L)  whose 
argument  is  a  vector  L  =  (A-,,...,  ^i,)>  such  that 
0  <  A-,  <  ...  <  A,  <  1,  let  L.  be  the  vector  maximizing  the 
function  K.(L).   Under  what  conditions  is  it  true  that  the 
vector  L*  maximizing  a  given  convex   combination  of  the  K.(L) 
is  itself  contained  (component-wise)  in  the  convex  hull  of 
the  L°? 

An  example.   Consider  the  example  of  two  symmetric 
shapes,  the  normal  and  logistic  distributions,  for  k  =:  3. 
We  have  A^  =  1  -  A  ,  Ap  =  .5,  so  that  we  need  find  A-j^  only. 
We  know  that 

^11  =  --^^^   (normal) 

A°2  =  -25    (logistic) 
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and  here  we  can  show  numerically  that 
.165  1  A*(g)  <  .25 

by  considering  solutions  of  (cf.  Eq.  (6.24)) 

.4-f,    f-,   .4-f-,    f. 
g(T5^  -  iq^^T^  ^   A7  -  2u,)  .   (l-g)(.25-A,)  ^  0  . 

As  g  varies  from  1  to  0,  'h-^   sweeps  out  all  values  from 
.163  to  .25. 


6.6.      Admissible  Spacing  Based  on  Approximate  Solution 

Applying  the  concept  of  an  admissible  spacing  to 
Sarndal's  nearly  optimum  solution,  we  might  consider 
maximizing 

m  m     K  (k|L) 

(6.33)   E^(k|L)  .  ^  6/.(k|L)  .  ^  g.  -^^^y- 

m    r    I(j)-K,(k|L)- 


1    r   iu;-^ai^i^n 


+  0[(A,^,-^,)'*]  , 
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where  we  have  made  use  of  (6.9),  adding  the  subscript  for 
shape  j . 

Here  (6.10)  Is  of  the  form 

(6.3M     A,^,-A.  =^!i!^  +  o(k-2)  , 

1  J.  I  u^  J 


u  J  =  G  .  ( A .  ) 


where 

1    J   1 
30  that  (6.33)  become 


(6.55)  E„(kL)=i — I — Yz 4^11:1-4--^^ 

^  12(k+l)^  J^^^^^  lf^(uJ)-J 


[f"(uJ)]2  +  o(k-^) 


.  1  _     1     ^  Sj   ^  [yj(uj)]^f^.(uj) 
12(k+l)^  j^  ^^^  I^    [f*(u^)]5 

+0(k-^) 
^  log  f  (x) 
(where  7.(x)  =  ^ 


ax 

„   .  -1 

I     . 

1             ^     ^3       /   °°     f7>)]^f_^.(x) 

12(k+l)5    j^  I(j)^-oo         [f*(x)]2 

+   o(k-2)    , 

Ox 


if  the  Integral  exists  Vj- 

If  the  integral  does  not  exist,  an  expression  analogous 
to  (6.13)  may  be  defined- 
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Next  we  need  to  minimize  the  Integral 

,  00   m  g  [7'(x)]^f  (x) 
{6.^6)  /    JZ^-^ *      ^-g  ^ 


.00  j=l   I(J)[fj(x)]' 


but  here  the  correspondence  with  the  single-shape  problem 
breaks  down,  since  we  are  dealing  with  m  generating  functions 

Assuming  that  a  set  of  functions  f  .(x)  were  found  that 
jointly  minimizes  {6.J>6),    this  set  would  yield,  by  definition, 

(6.57)  A^j  =  Pj[G*(7r^)]  ,    1  =   1,...,  k  , 

i.e.,  m  distinct  spaclngs  L.,  whereas  we  need  one  spacing 

to  determine  the  sample  quantlles  to  be  used  in  the  estimator. 

One  way  to  get  around  this  problem  is  to  allow  only 
one  generating  function  to  be  independently  selected  which 
then  determines  the  others. 

Without  loss  of  generality,  assume  that  f-j^(x)  determines 
the  spacing.   Then 


1 ,  k 


(6.58)  \   =   Fi[Gi(7r^)] 

but  let  also 


(6.39)  A^  =  Fj.[G*(7r^)]  ,    j  =  2,...,  m  , 

so  that 
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G*(7r^)  =  Gj{P^[G*(Tr^)]}  ,   i  ==  1,  .  .  . ,  k,  j  -  1,  .  .  . ,  m, 
from  which  we  may  obtain  f  .(x)  =  fqrn — T  •   Substituting 

(6.40)  fj(x)  =  ^ 

^lff*(x)] 

in  (6.36),  we  obtain  the  result  that  the  "nearly"  admissible 
spacing  L*  is  generated  by  the  function  f-|(x)  that  minimizes 
the  functional 


(6.41) 


^-00  ^  i(j)f^.[f^(^.^)].[f*(x) 


subject  to  the  subsidiary  condition 

-  CO 

■00 


(6.42)  /     f,(x)dx  =  1 


The  Euler  equation  corresponding  to  this  problem  cannot 
be  solved  explicitly  for  f-,(x),  so  that  an  iterative 
procedure  needs  to  be  developed. 


6.7.   Discussion 

As  is  evident  in  Sections  6.4-6.6,  complete  solution 
of  the  problem  of  admissible  spacing  requires  additional 
investigation  even  in  the  realm  of  theory.   The  author 
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Intends  to  pursue  some  of  these  questions  further  In  the 
future. 

In  addition,  large-scale  numerical  Investigations  are 
In  order.   As  was  pointed  out  In  Section  5-5j  new  numerical 
results  may  continue  to  provide  evidence  In  favor  of  settling 
on  simple  spaclngs,  and  thus  may  obviate  the  necessity  of 
searching  for  optimum  solutions.   (Plausible  criteria  for 
evaluating  a  spacing  {A.}  for  each  shape  j  ^  A  have  been 
defined  in  (6.I6)  and  (6.I9)  above.)   Some  relevant  results 
for  small  sample  sizes  are  given  in  Blrnbaum  and 
Laska  (1967b). 


120 


REFERENCES 

Bennett,  C.  A.  (1952).   Asymptotic  properties  of  ideal 

linear  estimators.   Doctoral  dissertation.  University 

of  Michigan. 

Bernstein,  S.  N.  (1917).   Theory  of  Probability 

(In  Russian) . 

Blrnbaum,  A.  (I96I).   Some  theory  and  techniques  for 

robust  estimation.   (Abstract)   Ann.  Math.  Statist. 

32:  622. 

Blrnbaum,  A.  and  E.  Laska  (1967a).   Optimal  robustness: 

a  general  method,  with  applications  to  linear  estimators 

of  location.   Submitted  to  J.  Amer.  Statist.  Assoc. 

Blrnbaum,  A.  and  E.  Laska  (1967b).   Optimally  robust 

linear  estimators  of  location.   Technical  Report, 

Courant  Institute  of  Mathematical  Sciences,  New  York 

University.   Abstract  to  appear  in  Ann.  Math.  Statist.  {I967) 

Blom,  G.  (1958).   Statistical  Estimates  and  Transformed 

Beta-Variables.   Wiley,  New  York. 

Chernoff,  H.  (I956).   Large  sample  theory:  parametric 

case.   Ann.  Math.  Statist.  27:  1-22. 

Chernoff,  H. ,  J.  L.  Gastwirth,  and  M.  V.  Johns,  Jr. 

(1967).   Asymptotic  distribution  of  linear  combinations 

of  functions  of  order  statistics  with  applications  to 

estimation.   Ann.  Math.  Statist.  38:  52-72. 

Craig,  L.  (I967).   A  property  of  blsymmetrlc  matrices. 

To  be  submitted  for  publication. 


121 


10.  Danlell,  L.  {I92O).   Observations  weighted  according 
to  order.   Amer.  J.  Math.  42:  222-236. 

11.  Elsenberger,  I.  and  E.  C.  Posner  (I965).   Systematic 
statistics  used  for  data  compression  In  space  telemetry, 
J.  Amer.  Statist.  Assoc.  6O:  97-133- 

12.  Elfvlng,  G.  (1966).  Robustness  of  Bayes  decisions 
against  the  choice  of  prior.   Technical  Report  No.  122, 
Department  of  Statistics,  Stanford  University. 

13.  Glrshlck,  M.  and  L.  J.  Savage  (I95I).   Bayes  and 
mlnlmax  estimates  for  quadratic  loss  functions. 
Proc.  Second  Berkeley  Symp.  Math.  Statist.  Prob. 
PP-  53-73,  University  of  California  Press. 

14.  Gupta,  S.  S.  and  M.  Gnanadealkan  (1966).  Estimation 
of  the  parameters  of  the  logistic  distribution. 
Blometrlka  53:  565-570. 

15.  Ha'jek,  J.  (I962).   Asymptotically  most  powerful  rank 
order  tests.   Ann.  Math.  Statist.  35:  1124-1147- 

16.  Hartlgan,  J.  A.  (I965).   The  asymptotically  imblased 
prior  distribution.   Ann.  Math.  Statist.  36:  1137-1152. 

17.  Huber,  P.  (1964).  Robust  estimation  of  a  location 
parameter.   Ann.  Math.  Statist.  35:  73-101. 

18.  Jung,  J.  (1955).   On  linear  estimates  defined  by  a 
continuous  weight  function.  Arklv  for  Matematlk, 
Band  3,  Nr.  I5,  I99-209. 


122 


19.  Karlln,  S.  (1958).   Admissibility  for  estimation  with 
quadratic  loss.   Ann.  Math.  Statist.  29:  406-436. 

20.  Laplace,  P.  S.  de  (l820).   Theorie  Analytique  des 
Probabilite^s  (3rd  edition).  Paris. 

21.  Laska,  E.  (I962).   A  general  theory  of  robustness. 
Doctoral  dissertation.  New  York  University. 

22.  Le  Cam,  L.  (I953).   On  some  asymptotic  properties  of 
maximum  likelihood  estimates  and  related  Bayes'  estimates. 
University  of  California  Publications  in  Statistics 

1:  277-330. 

23.  Le  Cam,  L.  (I958).   Les  propriete's  asymptotiques  des 
solutions  de  Bayes.   Publ.  Inst.  Statist.  Univ.  Paris. 
2:  17-35. 

24.  Lindley,  D.  V.  (I96I).  The  use  of  prior  probability 
distributions  in  statistical  inference  and  decisions. 
Proc.  Third  Berkeley  Symp  .  Math.  Statist.  Prob. 

1:  453-^68. 

25.  Lindley,  D.  V.  (I965).   Introduction  to  Probability 
and  Statistics  from  a  Bayeslan  Viewpoint.   Cambridge 
University  Press. 

26.  Lloyd,  E.  H.  (I952).   Least-squares  estimation  of  location 
and  scale  parameters  using  order  statistics.   Biometrika 

39:  88-95. 

27.  Mises,  R.  von  (I919).   Fundament alsatze  der 
Wahrscheinlichkeitsrechnung.   Math.  Zeit.  4:  1-97- 


123 


28.  Mosteller,  F.  (1946).   On  some  useful  "inefficient" 
statistics.   Ann.  Math.  Statist.  YJ :    377-''^08. 

29.  Ogawa,  J.  (1951).   Contributions  to  the  theory  of 
systematic  statistics,  I.  Osaka  Math.  J.  3:  175-213- 

30.  Pitman,  E.  J.  G.  (1939).   The  estimation  of  location 
and  scale  parameters  of  a  continuous  population  of  any 
form.   Biometrlka  3O:  391-421. 

31.  Rao,  C.  R.  (1965).   Linear  Statistical  Inference  and 
Its  Applications.   Wiley,  New  York. 

32.  Sarhan,  A.  E.  and  B.  G.  Greenberg  (I962).   Contributions 
to  Order  Statistics.   Wiley,  New  York. 

33.  Sarndal,  C.  E.  {I962).   Information  from  Censored 
Samples.   Almqvist  and  Wiksell,  Stockholm. 

34.  Stein,  C.  (I956).  Efficient  nonparametric  testing  and 
estimation.  Proc .  Third  Berkeley  Symp.  Math.  Statist. 
Prob.  1:  187-195. 

35.  Stein,  C.  (1959).   The  admissibility  of  Pitman's 
estimator  of  a  single  location  parameter.   Ann.  Math. 
Statist.  30;  970-979- 

36.  Tischendorf,  J.  A.  (1955).   Linear  estimation  techniques 
using  order  statistics.   Doctoral  dissertation, 

Purdue  University. 

37.  Tukey,  J.  (I96O).   A  survey  of  sampling  from  contaminated 
distributions.   Contributions  to  Probability  and 
Statistics  (I.  Olkln  et  al,  eds.)  pp.  448-485,  Stanford 
University  Press. 


124 


38.  Tukey,  J.  (I962).   The  future  of  data  analysis. 
Ann.  Math.  Statist.   35:  I-67. 

39.  Wllks,  S.  S.  (1963).   Mathematical  Statistics  (2nd 
printing).   Wiley,  New  York. 

40.  Wolfowltz,  J.  (1953).   The  method  of  maximum  likelihood 
and  the  Wald  theory  of  decision  functions.   Indag. 
Math.  15:  114-119. 

41.  Yhap,  E.  P.  (1966).   Asymptotically  admissible  linear 
xinbiased  estimators  (AALUE)  for  robust  estimation  of 
location  and  scale  parameters.   (Abstract)  Ann.  Math. 
Statist.   37:  IO7I-IO72. 

42.  Yhap,  E.  F.  (I967).   An  asymptotically  optimally 

robust  linear  unbiased  estimator  of  location  for  S5rmmetrlc 
shat)es.   Doctoral  dissertation.  New  York  University. 


125 


Security  Classification 


DOCUMENT  CONTROL  DATA  -  R&D 


(Security  ctaaalllcalion  of  title,  body  ot  abst: 


'  and  indexing  annotat. 


be  entered  when 


trail  report  j«  claa allied) 


1.  ORIGINATING  ACTIV/ITY  (Corporate  author) 

Courant  Institute  of  Mathematical  Sciences 
New  York  University 


REPORT   SECURITY     C  L  ASSI  FlC  A  TION 

Not  classified 


26     GROUP 


None 


3     REPORT  TITLE 


Contributions  to  Robust  Estimation 


DESCRIPTIVE  NOTES  (Type  of  report  and  Inctualve  dates) 

Technical  Report   September  196? 


S    AUTHORfS;  (Laat  name.  Ilrat  name.  Initial) 

Mike,  Valerie 


6    REPORT  DATE 


September 


7a-    TOTAL  NO.   OF    PACES  7b.    NO.   OF  REFS 

125  42 


Sa.     CONTRACT    OR    GRANT    NO. 

Nonr-285(58) 

6.  PROJECT  NO.  ^    042-206 


9a.    ORIGINATOR'S   REPORT   NUMBERCS; 


IMM   361 


r  numbers  that  may  be  atalened 


None 


10.  AVAILABILITY/LIMITATION  NOTICES 

Distribution  of  this  document  is  unlimited. 


II    SUPPLEMENTARY  NOTES 


None 


12.  SPONSORING  MILITARY  ACTIVITY 

U.S.  Navy,  Office  of  Naval  Research 
207  West  24th  St.,  New  York,  N.Y. 


13  ABSTRACT 

For  the  problem  of  efficiency-robust  estimation  of  the  location 
parameter  9  of  a  family  of  symmetric  pdf ' s  f(x-9|A),  A e  A =  {1, . . . ,m}, 
ee0  =  ?e|-oo  <9<oo},  the  method  of  "mixture  models"  of  Birnbaum 
(Ann.  Math.  Statist.  32  (I961)  622)  is  applied  to  determine 
generalized  Pitman  estimators,  which  are  shown  to  be  admissible 
squared  error  loss  function,  under  broad  regularity  conditions, 
increasing  sample  size,  these  estimators  are  proved  to  be  fully 
efficient  (i.e.,  asymptotically  equivalent,  for  each  value  of  A 
the  maximum  likelihood  estimator  which  would  be  appropriate  if  the 
true  value  of  A  were  known).   Computationally  tractable  analogous 
estimators  based  on  k  sample  quantlles  are  defined  in  the  context  of 
the  model  representing  their  asymptotic  normal  distributions.   It  is 
shown  that  with  increasing  k  these  approach  equivalence  to  the  fully 
efficient  estimators  based  on  complete  samples.   Equivalent  estimators 
are  given  also  for  the  case  of  unknown  scale  parameters.   Efficiency- 
robust  linear  unbiased  estimators  based  on  sample  quantlles  are 
derived  and  the  optimal  spacing  of  quantlles  is  discussed. 


with 
With 


to 


DD  /.?N«'J.  1473 


126 


Security  Classification 


Security  Classification 


KEY  WORDS 


INSTRUCTIONS 

\.    ORIGINATING  ACTIVITY:    Enter  the  name  and  address 
of  the  contractor,  subcontractor,  grantee.  Department  of  De- 
fense activity  or  other  organization  (corporate  author)  issuing 
the  report. 

2a.    REPORT  SECUFITV  CLASSIFICATION:    Enter  the  over- 
all security  classification  of  the  report.    Indicate  whether 
"Restricted  Data"  is  included.    Marking  is  to  be  in  accord- 
ance with  appropriate  security  regulations. 
26.    GROUP:    Automatic  downgrading  is  specified  in  DoD  Di- 
rective 5200.10  and  Armed  Forces  Industrial  Manual.    Enter 
the  group  number.     Also,  when  applicable,  show  that  optional 
markings  have  been  used  for  Group  3  and  Group  4  as  author- 
ized. 

3.  REPORT  TITLE:  Enter  the  complete  report  title  in  all 
capital  letters.  Titles  in  all  cases  should  be  unclassified. 
If  a  meaningful  title  cannot  be  selected  without  classifica- 
tion, show  title  classification  in  all  capitals  in  parenthesis 
immediately  following  the  title. 

4.  DESCRIPTIVE  NOTES:    If  ^propriate,  enter  the  type  of 
report,  e.g.,  interim,  progress,  summary,  annual,  or  final. 
Give  the  inclusive  dates  when  a  specific  reporting  period  is 
covered. 


5.  AUTHOR(S):     Enter  the  name(s)  of  authoKs)  as  shown  on 
or  in  the  report.     Entei  last  name,  first  name,  middle  initial. 
If  military,  show  rank  and  branch  of  service.     The  name  of 
the  principal  author  is  an  absolute  minimum  requirement. 

6.  REPORT  DATE:     Enter  the  date  of  the  report  as  day, 
month,  year,  or  month,  year.     If  more  than  one  date  appears 
on  the  report,  use  date  of  publication. 

7a.    TOTAL  NUMBER  OF  PAGES:    The  total  page  count 
should  follow  normal  pagination  procedures,  ue.,  enter  the 
number  of  pages  containing  information. 

76.     NUMBER  OF  REFERENCES:     Enter  the  total  number  of 
references  cited  in  the  report. 

8a.     CONTRACT  OR  GRANT  NUMBER:     If  appropriate,  enter 
the  applicable  number  of  the  contract  or  grant  under  which 
the  report  was  written. 

86,  8c,  &  8d.    PROJECT  NUMBER:     Enter  the  appropriate 
military  department  identification,  such  as  project  number, 
subproject  number,  system  numbers,  task  number,  etc. 
9a.    ORIGINATOR'S  REPORT  NUMBER(S):     Enter  the  offi- 
cial report  number  by  which  the  document  will  be  identified 
and  controlled  by  the  originating  activity.     This  number  must 
be  unique  to  this  report. 

96.    OTHER  REPORT  NUMBER(S):     If  the  report  has  been 
assigned  any  other  report  numbers  (either  by  the  originator 
or  by  the  sponsor),  also  enter  this  number(s). 
10.    AVAILABILITY/LIMITATION  NOTICES:    Enter  any  lim- 
itations on  further  dissemination  of  the  report,  other  than  those 


(1) 


(2) 


(3) 


(4) 


"Qualified  requesters  may  obtain  copies  of  this 

report  from  DDC." 

"Foreign  announcement  and  dissemination  of  this 

report  by  DDC  is  not  authorized. " 

"U.  S.  Government  agencies  may  obtain  copies  of 

this  report  directly  from  DDC.    Other  qualified  DDC 

users  shall  request  through 


"U.  S.  military  agencies  may  obtain  copies  of  this 
report  directly  from  DDC    Other  qualified  users 
shall  request  through 


"All  distribution  of  this  report  is  controlIe<l    ^al- 
ified  DDC  users  shall  request  through 


If  the  report  has  been  furnished  to  the  Office  of  Technical 
Services,  Department  of  Commerce,  for  sale  to  the  public,  indi- 
cate this  fact  and  enter  the  price,  if  knowiv 
lU    SUPPLEMENTARY  NOTES:    Use  for  additional  explana- 
tory notes. 

12.  SPONSORING  MILITARY  ACTIVITY:  Enter  the  name  of 
the  departmental  project  office  or  laboratoiry  sponsoring  (pay- 
ing for)  the  research  and  development.    Include  address. 

13.  ABSTRACT:    Enter  an  abstract  giving  a  brief  and  factual 
summary  of  the  document  indicative  of  the  report,  even  though 
it  may  also  appear  elsewhere  in  the  body  of  the  technical  re- 
port.    If  additional  space  is  required,  a  continuation  sheet  shall 
be  attached. 

It  is  highly  desirable  that  the  abstract  of  classified  reports 
be  unclassified.     Each  paragraph  of  the  abstract  shall  end  with 
an  indication  of  the  military  security  classification  of  the  in- 
formation in  the  paragraph,  represented  as  (TS).  (S).  (C).  or  (V). 

There  is  no  limitation  on  the  length  of  the  abstract.     How- 
ever, the  suggested  length  is  from  150  to  225  words. 

14.  KEY  WORDS:    Key  words  are  technically  meaningful  terms 
or  short  phrases  that  characterize  a  report  and  may  be  used  as 
index  entries  for  cataloging  the  report.     Key  words  must  be 
selected  so  that  no  security  classification  is  required.     Identi- 
fiers, such  as  equipment  model  designation,  trade  name,  military 
project  code  name,  geographic  location,  may  be  used  as  key 
words  but  will  be  followed  by  an  indication  of  technical  con- 
text.   The  assignment  of  links,  roles,  and  weights  is  optional. 


127 


Security  Classification 


Nonr  285(38) 

BASIC  DISTRIBUTION  LIST  FOR  UNCLASSIFIED  TECHNICAL  REPORT: 


Head,  Logistics  and  Mathematical  3  copies 

Statistics  Branch 
Office  of  Naval  Research 
Washington,  D.C.  20^60 

Commanding  Officer  2  copies 

Office  of  Naval  Research  Branch  Office 
.-Javy  /'lOO  Fleet  Post  Office 
New  York,  New  York 


Defense  Documentation  Center  20  copies 

Cameron  Station 
Alexandria,  Virginia  2231^^ 

Defense  Logistics  Studies  1  copy 

Information  Exchange 
Army  Logistics  Management  Center 
Fort  Lee,  Virginia 
Attn:   William  B.  Whichard 

Technical  Information  Officer  6  copies 

Naval  Research  Laboratory 
Washington,  D.C.  20390 

Commanding  Officer  1  copy 

Office  of  Naval  Research  Branch. Office 
207  West  24th  Street 
New  York,  New  York  10011 
Attn:  J.  Laderman 

Commanding  Officer  1  copy 

Office  of  Naval  Research  Branch  Office 
1030  East  Green  Street 
Pasadena  1,  California 
Attn:   Dr.  A.R.  Laufer 

Bureau  of  Supplies  and  Accounts  1  copy 

Code  OW 

Department  of  the  Navy 

Washington  25,  D.C. 

Institute  for  Defense  Analyses  1  copy 

Communications  Research  Division 
von  Neumann  Hall 
Princeton,  New  Jersey 

University  of  Chicago  1  copy 

Statistical  Research  Center 
Chicago,  Illinois 
Attn:  Prof.  Paul  Meier 


128 


Nonr  285(38) 


Stanford  University  Columbia  University 

Applied  Mathematics  &  Statistics  Lab,  Dept.  of  Industrial  Engineering 

Stanford,  California  New  York  27,  New  York 

Attn:  Prof.  Gerald  J.  Lieberman       Attn:  Prof.  Cyrus  Derman 


Florida  State  University 
Department  of  Statistics 
Tallahassee,  Florida 
Attn:  Prof.  I.R.   Savage 


Columbia  University 
Department  of  Mathematics 
New  York  27,  New  York 
Attn:  Prof.  H.  Bobbins 


Florida  State  University 
Department  of  Statistics 
Tallahassee,  Florida 
Attn:   Dr.  Ralph  A.  Bradley 

Princeton  University 
Department  of  Mathematics 
Princeton,  New  Jersey 

Columbia  University 

Department  of  Mathematical  Statistics 

New  York  27,  New  York 

Attn:   Prof.  T.W.  Anderson 

University  of  California 
Department  of  Statistics 
Berkeley  1,  California 
Attn:   Prof.  J.  Neyman 

University  of  Washington 
Department  of  Mathematics 
Seattle  5,  Washington 
Attn:   Prof.  Z.W.  Birnbaum 

Cornell  University 
Department  of  Mathematics 
Ithaca,  New  York 
Attn:   Prof.  J.  Wolfowitz 

Harvard  University 
Department  of  Statistics 
Cambridge,  Massachusetts 
Attn:   Prof.  W.G.  Cochran 

The  Research  Triangle  Institute 
Statistics  Research  Division 
505  West  Chapel  Hill  Street 
Durham,  North  Carolina 
Attn:   Dr.  Malcolm  R.  Leadbetter 


New  York  University 

Courant  Inst,  of  Mathematical  Sciences 

New  York   ,  New  York   10012 

Attn:  Prof.  W.M.  Hirsch 

Cornell  University 

Department  of  Plant  Breeding 

Biometrics  Unit 

Ithaca,  New  York 

Attn:   Walter  T.  Federer 

University  of  North  Carolina 
Statistics  Department 
Chapel  Hill,  North  Carolina 
Attn:   Prof.  Walter  L.  Smith 

Stanford  University 
Depai'tment  of  Statistics 
Stanford,  California 
Attn:   Prof.  Herman  Rubin 

Brown  University 
Division  of  Applied  Mathematics 
Providence  12,  Rhode  Island 
Attn:  Prof.  M.  Rosenblatt 

New  York  University 

Department  of  Industrial  Engineering 

and  Operations  Research 
Bronx  63,  New  York 
Attn:   Prof.  J.  H.  Kao 

University  of  Wisconsin 
Department  of  Statistics 
Madison,  Wisconsin 
Attn:   Prof.  G.E.P.  Box 


129 


Nonr  285(38) 

Prof.  Anders  Hald 

Institute  of  Mathematical  Statistics 

University  of  Copenhagen 

Copenhagen,  Denmark 


Prof.  Harold  E.  Dodge 

Rutgers  -  The  State  University 

Statistics  Center 

New  Brunswick,  N^w  Jersey 


Yale  University 
Department  of  Mathematics 
New  Haven,  Connecticut 
Attn:  Prof.  L.J.  Savage 


Yale  University 
Department  of  Statistics 
New  Haven,  Connecticut 
Attn:   Prof.  F.J.  Anscombe 


Massachusetts  Institute  of  Technology  Purdue  University 

Department  of  Electrical  Engineering  Division  of  Mathematical  Sciences 

Cambridge,  Massachusetts  Lafayette,  Indiana 

Attn:  Dr.  R.A.  Howard  Attn:   Prof.  S.S.  Gupta 

Cornell  University 

Department  of  Industrial  Engineering 

Ithaca,  New  York 

Attn:  Prof.  Robert  Bechhofer 


Stanford  University 
Department  of  Statistics 
Stanford,  California 
Attn:   Prof.  E.  Parzen 


Stanford  University 
Department  of  Statistics 
Stanford,  California 
Attn:  Prof.  C.  Stein 


Arcon  Corporation 
803  Massachusetts  Avenue 
Lexington  73,  Massachusetts 
Attn:   Dr.  Arthur  Albert 

University  of  California 
Institute  of  Engineering  Research 
Berkeley  >^ ,    California 
Attn:   Prof.  R.E.  Barlow 

Michigan  State  University 
Department  of  Statistics 
East  Lansing,  Michigan 
Attn:   Prof.  J.  Gani 

Dr.  Jack  R.  Bor sting 

Chairman 

Dept.  of  Operations  Analysis 

U.  S.  Naval  Postgraduate  Sch. 

Monterey,  California  959^0 


Applied  Mathematics  and  Statistics  Lab 
Department  of  Stati  tics 
Stanford  University 
Stanford,  California 
Attn:   Prof.  H.  Solomon 

Decision  Studies  Group 
^60  California  Avenue 
Palo  Alto,  California 
Attn:   Warren  R.  Ketler 

The  Johns  Hopkins  University- 
Department  of  Mathematical  Statistics 
34th  and  Charles  Streets 
Baltimore  I8,  Maryland 
Att:   Prof.  Geoffrey  S.  Watson 


130 


/  NYU 
IMM- 
361 


c.l 


Mike 


Contributions  to  robust 
estimation. 


NYU 
IMM- 


C.l 


Mike 


nontrlbutions  to  robust 
^Rt.imation. 


N.Y.U.  Courant  Institute  of 
Mathematical  Sciences 

251  Mercer  St. 
New  York,  N.  Y.  10012 


DATE  DUE 

J10V      -■:.!, 

. 

-^  Sim 

jiHi»- 

—-'^-'Vttf 

C.V.OHO 

""""'~"  =  " 

